INTRODUCTORY 
QUANTUM MECHANICS 



PRENTICE-HALL PHYSICS SERIES 



INTRODUCTORY 
QUANTUM MECHANICS 


BY 

VLADIMIR ROJANSKY 

PEOPESSOR OF EHYSICS 
UNION COIjLEOE 


New York 

PRENTICE-HALL, INC. 




COPYBIGHT, 1938, BY 

PRENTICE-HALL, INC. 

70 Fifth Avenue, New'York 

ALL BIGHTS BESEBVED. NO PART OF THIS BOOK 
MAY BE BEPBODUCED IN ANY FOBM, BY MIMEO- 
GRAPH OR ANY OTHER MEANS, WITHOUT PERMIS- 
SION IN WRITING FROM THE PUBLISHERS 


First Printing, October 1938 
Second Printing, April 1942 
Third Printing, January 1946 
Fourth Printing, September 1946 
Fifth Printing. November 1947 
Sixth Printing, January 1949 


PRINTED IN THE UNITED STATES OF AMERICA 



PREFACE 


This book is addressed to the reader who wishes to become 
familiar with some of the simpler physical ideas and mathematical 
methods of quantum mechanics, either because of their own 
intrinsic interest, or in preparation for a comprehensive and 
critical survey of the theory, or for a study of its applications. 
Although written primarily for graduate and advanced under- 
graduate students beginning a study of quantum mechanics, the 
text should also be of service to persons who are already acquainted 
with some of the concepts and results of the theory, and who wish 
to inquire into its formulation. 

The prerequisites are the elements of calculus and of ordinary 
differential equations, and a recognition of the failure of classical 
mechanics in the domain of atomic phjrsics. The reader’s ac- 
quaintance with partial differential equations and with matrices 
is assumed to be slight at best. 

Quantum mechanics is a large subject, and even introductory 
courses in it differ greatly in content and emphasis. Certain 
rudimentary topics', however, come up in almost evelry quantum 
mechanics course; and it is to some of these topics that this book is 
confined. I hope that its plan will, be found convenient for the 
interpolation and addition of further material in the classroom. 

The discussion proceeds from' the outset toward a coherent 
view of the main mathematical forms of the theory; but the 
exposition makes little pretense to rigor and often aims to make 
plausible rather than to prove. The arrangement of the book is 
essentially as follows: 


Mathematical topics. 


Problems in one- 
dimensional motion 


fthe classical method 

the Schroedinger (vave) method 

the momentum method 

the Heisenberg (matrix) method 

i,the Dirac (symbolic) method 

Problems in three-dimensional motion; the Sohroedinger method . 


OBAPTIBS 

I and IX 
II 

III-VII 

viri 

X 

XI 

XII 

XIII 

XIV 


The reader interested primarily in the Sohroedinger method 
may take up Chapter Xlf directly after Chapter VII; §§53 to 68 
of Chapter IX and §69 of Chapter X will then prepare him for the 
two chapters on electron spin. 



vi 


PREFACE 


The exercises provide drill in the material already covered, 
tie up some of the loose ends in the discussion of the text, and 
clear the ground for the topics to come. Most of the exercises 
are short, and a good imderstanding of the text should enable the 
reader to do many of them mentally. 

The main results and proofs presented here are of course well 
known to theoretical physicists, while most of the supplementary 
arguments can perhaps also be found elsewhere, between the lines 
if not in print. The courses of Professor J. H. Van Vleck, Profes- 
sor E. U. Condon, Professor P. A. M. Dirac, and Professor H. P. 
Robertson, various lectures by others, a few letters, and many 
conversations have all put their imprint on these pages. Through 
their reactions to different ways of presenting various topics, the 
persons who attended my own classes have also played a large 
part in shaping this book. The entire manuscript was thoroughly 
improved by my colleague Professor A. H. Fox, and further 
important refinements were made by Dr. Condon, Editor of the 
Prentice-Hall Physics Series. In the handling of the manuscript 
and the preparation of the diagrams I had student asistance 
sponsored by the N. Y. A, To the editorial department of 
Prentice-Hall, Inc. I am indebted for sympathetic cooperation. 

V, Rojansky 



CONTENTS 

CHAPTBR PAQB 

I. Mathematical Peeliminaeies 1 

1. Introduction 1 

2. Differential operators 2 

3. The operator — cP/da:* 11 

4. The operator — dyda:® + a:® 16 

5. The operators — id/da:, x, and c 27 

6. Simultaneous eigenfunctions 28 

7. Degenerate eigenvalues 31 

8. Orthogonal sets of functions 32 

11. Remarks on Classical Mechanics 43 

9. Hamiltonian functions 43 

10. Examples of classical motion 46 

11. Distribution functions 61 

12. Probability packets 63 

HI. Elements of the Schroedinger Method; 

One Dimension 72 

13. Two assumptions 72 

14. Schroedinger operators 77 

16. Energy levels of an oscillator 81 

16. Schroedinger functions 85 

17. Specification of states 89 

18. Energy states of an oscillator 93 

19. Composition of states 106 

20. Resolution of states 109 

21 . Probability packets 112 

22. Coherent and incoherent superposition of 

states 117 

23. Heisenberg inequalities 118 

24. The Heisenberg principle 122 

25. Kennard packets 125 

26. Summary 128 

IV. Further Problems in the Schroedinger 

Method 130 

27. Linear momentum 130 

28. The free particle in one dimension 144 

vii 



CONTENTS 


viii 

CHAPTBB 

29. One-dimensional rectangular potential wells. . . 160 

30. The one-dimensional box. .. .* 157 

31. Radiative transitions 162 

32. Properties of the one-dimensional first Schroe- 

dinger equation 168 

V. Approximate Methods FOE Tee AT iNG THE One- 

Dimensional ScHBOBDiNGEK Equation — ,180 

33. Numerical approximations 180 

34. Expansions in powers of h 186 

35. A variation method 193 

36. A perturbation method for nondegenerate 

states 198 

VI. One-Dimensional Probability Currents and 

De Broglie Waves 207 

37. Probability current 207 

38. One-dimensional potential barriers 211 

39. Virtual binding 222 

40. Waves 230 

41. De Broglie waves — ^5 

42. A vibrating string and a particle in a box 239 

VII. Constants op Motion 247 

43. Time derivatives of averages of dynamical 

variables 247 

44. Time derivatives of dynamical variables 250 

45. Constants of motion 257 

46. Dirac’s expansion theorem 260 

47. Dynamical variables having no classical ana- 

logues 262 

48. Constants of motion having no classical ana- 

logues 266 

49. Potential lattices and energy bands 269 

VIII. Remarks on the Momentum Method 277 

50. The framework of the momentum method 277 

51. Energy levels of an oscillator 281, 

62. Concluding remarks 282* 

IX. Linear Operators AND Matrices 285 

63. Mappings of a plane; 285 

54. Representation of mappings by means of 

matrices 293 



CONTENTS 


IX 


CHAPTER page 

55. Different coordinate frames may yield different 

representatives for a given operator 306 

56. Matrices and vectors 309 

57. AT-dimensional spaces 320 

58. Further remarks on matrices 322 

59. Function space 327 

60. Representation of differential operators by 

means of matrices 334 

61. Concluding remarks 340 

X. Elements of the Heisenberg Method 341 

62. Heisenberg matrices 341 

63. Heisenberg matrices (confmMed) 347 

64. The Heisenberg method 353 

65. Energy levels of an oscillator 355 

66. Diagonalization of matrices 360 

67. Approximate diagonalization of matrices 368 

68. The matrix perturbation method for non- 

degenerate states 374 

69. More matrix algebra 380 

70. The Heisenberg representation of states 386 

XL Remarks ON THE Symbolic Method 391 

71. Symb'olic operators and operands 391 

72. Energy levels of an oscillator 393 

73. The symbolic ^’s 397 

XII. Problems in Three-Dimensional Motion; the 

ScHROEDiNGER Method 401 

74. Mathematical preliminaries 401 

75. Remarks on classical mechanics 416 

76. Elements of the Schroedinger method 426 

77. Elements of ther Schroedinger method (con- 

tinued) 440 

78. Central fields 448 

79. The free particle; polar quantization 452 

80. The Bohr formula for the hydrogen atom; 

fixed nucleus 457 

81. The Bohr formula for the hydrogen atom; 

free nucleus 465 

82. The fine structure of the hydrogen levels. 470 

83. Relativity corrections for hydrogen 472 



X 


CONTENTS 


CHAPTER PAQS 

XIII. Elements op Pauli’s Theoby of Electron 

Spin 477 

84. The electron-spin hypothesis 477 

85. Possible values of components of angular 

momenta 479 

86. The Pauli operators and operands 482 

87. The simultaneous eigen-t/'^s of J«, and J^. . . 490 

88. Spin-orbit interaction and the spin correction .. 496 

89. The spin-and-relativity correction for hydrogen 601 

90. Space functions and spin functions 603 

XIV. Elements OF Dirac^s Theory of the Elec- 

tron.! 505 

91. The Dirac equation for an electron in an 

electrostatic field 505 

92. The Dirac or’s 512 

93. The existence of a ^spin' angulal* momentum. . . 513 

94. Constants of motion for a central field 514 

95. The radial equations 518 

96. The energy levels of the hydrogen atom 521 

97. The normal state of the hydrogen atom 525 

N 

ilPPENDIX 531 

Index 535 

Note: The numbering of equations, exercises, tables, and figures is 
iescribed on page 27 (footnote), page 29 (footnote), and page 74. 

Latin and Greek symbols are listed in the index, the latter in the alpha- 
Detical order of their English names. 

Mathematical symbols not admitting of alphabetical listing are grouped 
n the index under the heading 'symbols.' 



INTRODUCTORY 
QUANTUM MECHANICS 



Chapter 1 

MATHEMATICAL PRELIMINARIES 
1. Introduction 

In 1900 Max Planck introduced the idea of a quantum of 
action. In 1905 Albert Einstein suggested the wave-particle 
duality of radiation. In 1913 Niels Bohr formulated the first 
substantially successful theory of atomic phenomena. In 1923 
Louis de Broglie injected the wave-particle duality into the theory 
of matter. And a quarter of a century after Planck^s discovery 
came quantum mechanics. 

Quantum mechanics was invented twice: first by Werner Hei- 
senberg, and a few months later by Erwin Schroedinger. But 
Heisenberg's matrix mechanics and Schroedinger's wave mechanics 
were so diJfferent that for a time their mathematical equivalence 
remained unrecognized/ 

Then began the widespread and remarkably successful activity 
in developing the new mechanics for application to a growing 
range of problems in physics and chemistry. As the theory be- 
came better understood, generalized and clarifying formulations 
of it were put forward, notably by Paul Adrien Maurice Dirac 
and by Pascual Jordan. A relativistic quantum mechanics of the 
electron was constructed by Dirac. • A mathematically rigorous 
formulation of the theory was given by Johann von Neumann. 
And, as a result of all these developments, quantum mechanics 
is today not only much wider in scope than it was at its inception, 
but also more compact, more orderly, and easier to learn. 

The mathematics required for the quantum-mechanical treat- 
ment of the motion of a particle is more elaborate than that 
necessary for the classical treatment of particle motion, so that a 
person beginning the study of the quantum-mechanical behavior 
of a particle may find his mathematical preparation quite inade- 
quate, even though he is well versed in the corresponding classical 

1 For a vivid account of the early history of quantum mechanics see 
J. H. Van Vleck’s reports in Jour. Optical Soc. Am. and Rev. 8ci. Insir., 
14, 108 (1927), and 16, 301 (1928). 


1 



2 


MATHEMJLTICAL PRELIMINAEIES 


§2 


problem. For this reason, -whenever such background cannot be 
expected to have been acquired in connection with a study of 
classical particle motion, we shall make it a part of our aim to 
develop the mathematical background necessary for the work. 
But the purely mathematical discussion will be restricted to the 
necessary ru inim uro, and will often forego mathematical rigor. 

In order that the reader may not be confronted concurrently 
with two quite distinct tasks — ^that of becoming acquainted with 
certain standard mathematical methods, and that of learning the 
use and the physical significance of these methods in quantum 
mechanics — ^we devote this chapter to an elementary account of 
the mathematical methods involved in the physical considerations 
of Chapter III, where quantum-mechanical ideas will first be 
introduced. 


2. Differential Operators 


A central part in the mathematical structure of quantum me- 
chanics is played by operators of several types; of these we shall 
now consider the so-called differential operators, which are used 
in the theory of ordinary differential equations. 

Among the various mathematical operations that can be per- 
formed on a function u{x) of the variable x [such as uCx) = x 
and u(x) = sin x] are differentiation of u{x) with respect to Xj 
multiplication of n(x) by x, and multiplication of u{x) by a con- 
stant, say c. In studying the properties of these and other opera- 
tions it is convenient to introduce the concepts of an operator 
and an operand. It is convenient, for example, to regard the 


expression -j- n{x) as consisting of two constituents : the operator 
dx 


JLx^ 


and the operand v>{x); to regard the expression xu{x) as 


consisting of the operator [x] and the operand u{x ) ; and to regard 
the expression cu(x) as consisting of the operator [c] and the 


operand u{x). The operator 
requirement that the relation 


dl . . 


in fact, defined by the 


^ -uOi) = n'ix) 


( 1 ) 



3 


MATHEMATICAL PRELIMINARIES 


§2 


hold for an arbitrary differentiable function ii(x), u'(!d) being the 
first derivative of u(x) obtained by the rules of differential cal- 
culus; the operator [x] is defined by the requirement that 

[a:]w(a:) = xu{x) (2) 

for- an arbitrary u(x), the right side of (2) being the algebraic 
product of X and u(x); and the operator [c] is defined by the 
requirement that 

[c]«(a:) = cu(x) (3) 

for an arbitrary u(x), the right side of (3) being the algebraic 
product of u{x) and the constant c. 

We shall usually denote the operators M) and [c] by 

d 

^ (or d/dx), X, and c, respectively; brackets were used above in 

order that Equations (1) to (3) might not appear trivial. The 
operator symbols d/dx, x, and c must not be confused with symbols 
representing ordinary functions and numbers. This is perhaps 
clear at once in the case of the operator djdx] that this is also so 
in the case of the operators x and c ■will become clear later. The 
reader should acquire the habit of inquiring ■whether a mathe- 
matical expression that he encounters in a quantum-mechanical 
calculation has to do with operators, or ■with ordinary functions 
and numbers, or with operators and ordinary functions and 
numbers. 

The processes of operating with the operators d/dx, x, and c, 
are called, respectively, differentiation with respect to x, multi- 
plication by X, and multiplication by the constant c. Among 
the operators of type e are the operator 1 (the 'unit operator] 
idem/ actor, or simply 'unity), which leaves every operand unal- 
tered, and the operator 0 (the zero operator or simply zero), which 
annihilates every operand. Operators of type c are often called 
constants. When discussing operators in general, we usually 
denote them by Greek letters. 

If the respective results of operating on the operand w with 
the operators a and are added together, then the final result 

aw -f- |3 i4 (4) 


is usually denoted by 


(cx -f- 0)u, 


( 5 ) 



SfclfHEMATICAL PRELIMINARIES 


§2 


l^ibg suggested by the notation of algebra; thus 
the expression (dfdx)u(x) -j- xu(x), that is, the sum of the respec- 
tive *eiults <#^^|J«^aitdng on «(*) with d/dx and with x, is written 
as •»)te(a:)r This notation is readily extended to sums 

of <^ respeotlye‘ results of operating on the same operand with 
ihore*' fea«fi**We*o|pet<EiitorB. 

ini ia^thef eontentibn concerns consecutive operations. If we 
fiS^'iG^perate (Stt tt with the operator a, getting the result otu, and 
awrt* opersiie on this result with the operator /J, then the final 
lesvrirti^ ' i ' ' 'I 


nn 1 V> 


l8(au) 


( 6 ) 


<4 operations is usually denoted by 

ffotUj 


(7) 


it being understood that to evaluate an expression of typo ( 7 ) we 
must fird operate on u with a, and nearf operate on the result so 
^tained with ff. For example, if we first operate on u(x) with 
the operator x, getting xu{x), and next operate on this result 
with d/doi, then the final result d/da:[a:w(a!)] must be written in 
the conventional notation as {dfdx)xu{x)] on the other hand, if 
'^e first operate on «(x) with djdx, getting {d/dx)u{^x), and next 
CfJjk’ate on this result with x, then the final result x[(d/dx)u(x)] 
must be written as x(d/dx)w(x). Incidentally, by the rule for 
differentiating a product, (d/dx)[xtt(x)] = xid/dx)v.{x) •+■ «(x), 
that is, 

^ xu(x) = » ^ «(») + «(®), ( 8 ) 

so' that 


^ xu(x) a: ^ u(x) (9) 

unless u(x) =» 0. The inequality (9) illustrates the importance 
of remembering the order in which the individual operations must 
be carried out when the conventional notation is used for suc- 
cessive operations. Extending this notation to cases involving 
more than two consecutive operations, we write yfiau for 7 [i 8 (au)], 
fioeyu for (Jla( 7 u)], Syfiotu for J{ 7 [/ 3 (£m)]}, and so forth; if a certain 
operation is repeated without the intervention of other opera<- 



||ATRt»IATICAl. PRELIMINARIES 


UOMk iv« «M ^ noUikm utd write, for example, 

flfWW f« 

OpwilnT tn InveMifating the propertiee of a given 

rtaNi of oprfaioni. «* may proceed along two linee. One is to 
tke iMidla of operating with the operators on various 
ly y Muta, and to obtain tbe deaired infomudion through a study 
of tbe itlofnmT mmtm$ the rmntto ef th$ optnUumt. The other is 
to t B tn ri l**** tbe concept of certain miaboM amonp the operaiore 
IlMMHlMi, to delmi (mins opnands) the criteria for the existence 
of mA rclelimw to tnlabHab (using operands) the existence of 
MtUrin Rinrteiwntal reiatiooa among some of tbe operators of the 
riam, and IbM. by dcdmdng additional relaUons among the 
opamlan (mm tlw fundamental ones, to continue the study with- 
out hmbef refeieoee to operands. The latter method, that 
•pcralsr diyibm, rmle on tbe (ottowing definitions of equality, 
MM, and produet of opmalora. 

11 tha opaialoia o and ft am of suob n nature that 

cm ■■ dn 


for mmy opmand u on wtdeb both ean operate, then « and d 
ait diinid to be efual to eniA other, and we write 

o ■ d, 

ibe nolotton o( ocdbiaey alphra We need not con- 
S^we tbi nnmffiiiBlT of mdetanoe of operanda on whloh one 

.«i«adtoi).uD«mri 

opMWMl » at wklok both om opormlo ond for 

yu-m. + du (12) 

for every operand u for wbiob tbo operatloM Involv^ IM^) 

and we write 

t - • + d. 



6 


MATHEMATICAL PRELIMINARIES 


§2 


In particular, we may regard the symbol a + pin {a + p)u as a 
distinct operator, the sum of a and p. For example, d/dx +• x 
in {d/dx + x)u{x) may be regarded as a distinct operator, defined 
as the sum of d/dx and x, and having properties of its own. 

If the operators a, p, and y are of such a nature that 

yu = a{Pu) (14) 


for every operand u for which the operations in (14) can be 
carried out, then y is defined to be the product aP of a and P, 
and we write 

7 = ojS. (15) 

In particular, we may regard aP in aPu as a distinct operator, the 
product otP of a and p. For example, {d/dx)x in {d/dx)xu{x) 
may be regarded as a distinct operator. Incidentally, the operator 
{d/dx)x may serve to illustrate the importance of not confusing 
the operator x with the ordinary function x: the operator symbol 
{d/dx)x must be understood to symbolize the double operation of 
first multiplying an operand by x and then differentiating the 
result so obtained with respect to x; no numerical value must be 
ascribed to this symbol, even though the symbol, when no operand 
is written after it, may be mistaken- for the derivative of the 
function x with respect to x. 

Note that y in (14) was defined as the product <xP of a and p, 
rather than simply the product of a and p. The reason for this 
is that the product Pa may be different from the product ap; 
for example, it follows from (9) that 


d ^ d 
dx^ ^ dx’ 


(16) 


The definitions of equality, sum, and product of operators given 
above enable us to construct any number of distinct operators 
from our three fundamental operators d/dx, x, and c, such as 
d/dx + X, d/dx + a; + 3, x^, Zd/dx, —d/dx, (d/dxY, {d/dxf' -|- 
{d/dx + xf, and so forth; for lack of a more descriptive short 
term, we shall call these operators differential operators. The 
operator-algebraic rules for handling differential operators are 
qmte similar to those of ordinary algebra, the conspicuous excep- 
tion being, however, that the order of the factors in a product is, 
in general, important. An algebra in which the order of the 
[actors may affect the significance of a product is said to be 



7 


MATHEMATICAL PRELIMINARIES 


§2 


noricommutative. The algebra of the differential operators is non- 
commutative, as illustrated by (16). If the products afi and Pol 
of two operators a and p are equal, we say that a and p commute; 
otherwise we say that they do not commute. We may add equal 
operators to both sides of an operator equation without destroy- 
ing the equality; but if the algebra is noncommutative we must 
be careful in multiplying an equation through by 'an operator; 
for example, if a = P, then ya = yPj and ay = Py, but ay does 
not necessarily equal yp, nor does ya necessarily equal py. The 
process of getting ya = yP from a — pi& called multiplication by y 
from the left; that of getting ay = Py from a = jS is called multi- 
plication by y from the right. 

Commutators. The operator ap — Pa is called the commutator 
of the operators a and p. The commutator of two operators may 
sometimes be identified with an operator having a simpler form 
than the defining expression ap — Pa. For example, the com- 
mutator 


± _ ± 

dx^ ^ dx 


(17) 


of the operators d/dx and x can be put into a simpler form as 
follows: we write (8) as 


dx 


xu(x) 


-(•s 





(18) 


since (18) holds for an arbitrary differentiable operand u(x), we 
have 


£ 

dx 


X 



(19) 


that is, 


d d 

dx^ ^ dx 


= 1 . 


( 20 ) 


The commutator of d/dx and x is thus the unit operator. Simi- 
larly, the commutator of x and d/dx is —1. 

The commutator of d/dx and x^ can be simplified in appearance 
as follows: By the rules of the calculus, 



= £ x’‘u(x) — a? £ u(x) 
dx dx 


= 2xu(a:) -f- a:* u{x) — x^ £ u(x) = 2a:u(a!) ; (21) 

dx dx 



8 


MATHEMATICAL PRELIMINARIES 


§2 


hence the equation 

holds for an arbitrary differentiable u(x)j and we have 


^0? — ^ ^ = 2x, (23) 

ax dx 

The commutators of pairs of more complicated operators can 
often be simplified in appearance in a similar manner. The com- 
mutator of two commuting operators is, of course, the opera- 
tor zero. 

Let us now illustrate the symbolic methods of operator algebra 
by deriving (23) from (20) without further rejerence to operands. 
Multiplication of (20) by x from the left yields 


X 


dx ' 


2 d 


= 


(24) 


while multiplication of (20) by x from the right gives 



d 

a; — a; = a;; 
dx 


adding (24) and (25), we now get 


(25) 


(26) 

in agreement with (23). 

Linear operators. An operator a is said to be linear if for 
arbitrary operands u and v 

a{u + v) ^ au oiVj (27) 

and for an arbitrary constant c 


ac == ca. (28) 

The simple differential operators discussed above are examples of 
linear operators. In quantum mechanics we are concerned almost 
exclusively with linear operators, and throughout our work the 
term operator will be used to mean linear operator, unless the 
contrary is explicitly stated. 



9 


MATHEMATICAL PRELIMINARIES 


§2 


Exercises 

1. Verify that the result of operating on the operand sin x with the 
operator [{d/dx)xY is sin a; + 3a; cos a; — as* sin x, and that the result of oper- 
ating on the same operand with the operator [a; (d/da;)]* is x cos a; — a;® sin x, 

2 . Verify that {d/dx + 1 )* ~ (d/da;)* 4- 2d/dx + 1, and that (d/da; + x)* = 
{d/dxy + (d/da:) a; -f x{d/dx) + a;*. Note that (d/da: +• a;)* is not equal 
to (d/da;)* 4 2{d/dx)x 4 a:*. 

3, In ordinary algebra we have: (a + i8)(a — /3) = a* — /3*. Under 
what condition can this formula be used when a and are operators? 
What formula should be used instead when this condition is not fulfilled? 

4. Consider the products a/ 37 , /5«7» 7 «i 8 » and y^a of operators 

a, /3, and 7 . How many of these products are different operators if no 
two of the three operators a, /3, and 7 commute? If a and /3 commute 
with each other, but neither a nor i3 commutes with 7 ^ If « commutes 
with jS and with 7 , but /3 and 7 do not commute? If every two of the three 
operators commute? Illustrate each of these possibilities, using operators 
of the types discussed in the text. 

6 . Show that the commutator of the operator x and the operator 
(cos* a)x(d/dx) 4 (sin* a)(d/da;)a;, where a is a constant, is independent of a. 
Show that the commutator of d/dx and (cos* a)x(d/dx) 4 (sin* a)(d/dx)x 
is also independent of a. 

6 . Show, using calculus, that for all positive integral values of n we have : 
(d/dx)x^ — x^(d/dx) - nx^~^. 

7. Show by operator algebra that, if a/S — « 1, then a/S* — i3*a = 2/3, 

a/S* — /3*a = 3/3*, and, in general, a/S** — /3”a =* n/S'^’'^ 

8 . Derive the result of Exercise 6 , taking for granted (20) and the result 
of Exercise 7. 

9. Reduce the commutator of (d/dxY and a; to a form not involving the 
operator x, 

10. Verify the following identities, in which the Greek letters denote 
arbitrary linear operators, c denotes a constant, and (a, /3} denotes 
a/3 — / 8 a; 

1^1 I?! “ “(’7, 

1 ^, cl « 0 

+ ^2 , ’ll == {€l f v) 4 - {^2 , v] 

4 v^\ =“ vi] + {^1 ’ 72 I (29 a-g) 

I ’o] = {€x > + €1(^2 , v) 

lt> ** 1^7 Vi]rj% 4- vil^t Vi] 

(t, [v^ fll + iv^ (r, fll 4 ir, { 5 , ^11 “ 0 . 

11. Show that the operator 9 /da;, which stands for partial differentiation 
of any operand (now explicitly allowed to be a function of several variables) 
with respect to a;, satisfies the equation 


( 30 ) 



10 


MATHEMATICAL PEELIMINARIES 


12. By a generalizatioa of the arguments given below, prove the follow- 
ing theorem for the case when the operand u can be expanded into a power 
series in the variable x: if the operator a commutes with x and with d/dx^ 
that is, if 

az — xa ^ Q ( 31 ) 

and 


a 


dz 



( 32 ) 


then, for an arbitrary w, 


au =s cu, 


( 33 ) 


where c is independent of x and d/dz, so that, insofar as dependence on x 
and d/dxiQ concerned, a is a constant.^ 

[Let u= ui - 1; then, in view of (32), {d/dx)aui = ot{d/dx)ui = 0, and 


^ (otui) « 0 , ( 34 ) 

ax 

so that aui = c, where c is independent of x, or 

aUi == CUi . ( 35 ) 

Next let u = t /2 = x, so that awa = ax, or cxUi « axui , azui being the 
result of operating with a on the product of the variable x and the number 1 . 
Now, we may regard x in axui as being the operator x instead of the variable 
X, and write axui =» (o£x)ui without affecting the result; but, in view of 
(31), {ax)ui = xaui , so that, because of (35), we have ocui =* xc, and hence 


OtU2 = CU2 . ( 36 ) 

Thus (33) follows from (31) and (32) in the two special oases u 1 
and w ** X.] 

13. Show that, if a satisfies the two operator equations 


d d 

Xa — OCX ^ Oj — a — a— »2x, (37) 

035 

then the operator (« — x*) commutes with x and with d/dx; then, using the 
result of Exercise 12, show that the general solution of Equations (37) is 

a — X* -h C, ( 38 ) 

where c is a constant insofar as dependence on x and 9 /dx is concerned. 
Show in a similar way that the general solution of the simultaneous operator 
equations xa — atx = 0 and (B/dx)a — a{d/dx) = Ax**, where A is a con- 
stant, is a A(n -j- l)"ix*+i + c. 


* The operator ^ ® ^ is 8'^^ example of a constant in disguise. 



11 


MATHEMATICAL PRELIMINARIES 


3. The Operator — — ^ 

We shall be concerned for the present with real and complex 
functions u(x) of the real variable x, and shall usually write 
Uj w', w", and so forth, for u(x)j du{x)/dx, d%(x)/dx^, and so 
forth. The moduli (or absolute values, see §8) of x and u{x) 
will be denoted, respectively, by [ a; ] and \u\; since our x is 
real, \ x j equals x when x is positive, and —a; when x is negative. 
The statement that a function of x remains finite when a; oo 
will mean that when x increases without limit this function does 
not tend to (as does, say, e*) or to — «> (as^does, say, — c ), 
and does not oscillate with amplitude approaching oo (as do, 
say, a; sin a; and — a;cos^a;); the meaning of the statement that 
a function of x remains finite when a; ^ — oo is similar. The 
words *^when | a; [ — » will mean ^*when a? — ^ oo and also when 

X — 00.’^ 

If u(x) and du{x)/dx are both continuous for all real values of x, 
we say that u{x) satisfies our standard continuity conditions. 

If w(a;) [or 1 w I, if is complex or pure imaginary] remains 
finite when | a; [ — ^ oo , we say that u{pi) satisfies our standard 
boundary conditions? If u{x) 

(a) , satisfies our standard continuity conditions, 

(b) , satisfies our standard boundary conditions, and 

(c) , is not identically zero, 

we call u{x) a well-behaved function. For example, the four func- 
tions shown in the top row of Fig. 1 are well-behaved, while the 





\ , 





X ^ 

X 

X 




^ \J 

V/ X 


u^x 



u<* sin |x|. 

P 

r\-a 

“7 

x 

X 


\J X 


Fig. 1. Top row: examples of well-behaved functions j bottom row: 
examples of ill-behaved functions. 


* It should perhaps be emphasized that to satisfy our boundary condi- 
tions a function must remain finite when a? — ► <» and also when »—►*-«>. 



MATHEMATICAL PRELIMINARIES 


12 


§3 


four functions in the bottom row are ill-hehavedj that is, not 
well-behaved. 

The problem of immediate interest to us is to consider the 
conditions under which certain second-order differential equa- 
tions possess well-behaved solutions. Some differential equations 
have a well-behaved general solution; for example, the general 
solution of 

u" = -w (1) 

is 

w = A sin a; cos x, (2) 

and satisfies our standard boundary and Continuity conditions for 
any choice of the arbitrary constants* A and B, so that (2) is 
well behaved for any choice of A and B except ^ = J5 = 0, 
Some differential equations do not have any well-behaved solu- 
tions at all; for example, the general solution of 

u" = n (3) 

is 

u = Ae* + Be“*; (4) 


the first term in (4) increases without limit when x co , and the 
second does so when a; — oo ; hence no adjustment of A and B 
(except A = B = 0) yields a solution satisfying our boundary 
conditions. 

Finally, a differential equation may not possess well-behaved 
solutions in general, but does possess them if a numerical constant 
appearing in the equation is suitably chosen. This case will be 
of special interest to us, and we shall illustrate it by several 
examples, the first of which will be to determine the values of the 
numerical constant \ for which the equation 




( 6 ) 


* In anticipation of working with several independent variables, note 
that the statement that a quantity involved in a mathematical expression 
is a constant means only that this quantity does not depend on the inde- 
pendent variable or variables explicitly appearing in the expression; for 
example, the statement that A in (2) is a constant means that A does not 
depend on x, but does not exclude the possibility that A depends on inde- 
pendent variables other than x. 



13 MATHEMATICAL PRELIMINARIES §3 

of which (1) and (3) are special cases, possesses well-behaved 
solutions, and to find these solutions. 

Now, if X 7 ^ 0, the general solution of (6) is 

u = Ai cos X* a: As sin X* x, (6) 

where the A’s are arbitrary constants, and where X* denotes 
-t-A/X' The solution (6) satisfies the continuity conditions re- 
gardless of the values of the A’s, and it remains to be seen if the 
boundary conditions are also fulfilled; in doing this we shall con- 
sider several cases separately. 

First, let 

X > 0, . (7) 

so that X* is a nonvanishing positive number; (6) then remains 
finite everywhere, and hence the boundary conditions are satisfied. 
Next, let 

X < 0, (8) 

so that X* is a nonvanishing pure imaginary number, say ia, where a 
is positive. It is convenient in this case to write the general 
solution of (5) as 

w = J5ie" -f Bje"" (9) 

rather than as (6), and it is seen that if X < 0, then neither the 
general solution of (5) nor any of its particular solutions are well 
behaved. 

We shall leave it to the reader to show that if X is a complex 
or a pure imaginary number, then (5) does not possess any well- 
behaved solutions, and we shall proceed to the remaining case, ' 

X = 0. (10) 

The general solution of (5) is now 

u = Bx C, (11) 

where B and C are arbitrary constants. The term Bx does not 
satisfy our boundary conditions, while the term C does; hence 
the special solution 

M = C, (12) 

obtained by setting B = 0 in (11), satisfies all our conditions, 
and consequently (5) does possess well-behaved solutions when 
X = 0. Note that the results in the two cases X > 0 and X = 0 



14 


MATHEMATICAL PRELIMINARIES 


§3 


differ in an important respect. In the case \ > 0 the general 
solution itself is -well-behaved; -while in the case \ = 0 the general 
solution is not, but the particular solution (12), containing a 
single arbitrary constant, is. Note also that, although (6) is not 
the general solution of (5) wh,en X = 0, the substitution X = 0 
into (6) yields, nevertheless, the solution (12), that is, the solution 
satisfying the boundary conditions -when X = 0. 

To summarize: Equation (6) possesses well-behaved solutions- 
if, and only if, 

X > 0; (13) 

and the well-behaved solutions have the form (6), or the equiva- 
lent form 

u = -h (14) 

which for most of our purposes is more convenient than (6) ; it is 
to be understood that the trivial case A = B = 0 in (14) is to be 
excluded. 

Eigenvalues and eigenfunctions. To simplify further dis- 
cussion, we introduce the following terminology. If, for the 
specific value Xi of the numerical constant X, a well-behaved 
operand Ui satisfies the equation 


au == \Uj 

where a is a given operator (that is, if 

(15) 

dUi X{l£t , 

(16) 


and Ui is well behaved), we say that the number \i is an eigen-^ 
value^ of the operator a, that the operand Ui is an eigenfunction^ 
of a, and that the eigenvalue X*- and the eigenfunction Ui of the 
operator a belong to each other. 

For example, since the operand sin 3a; is well-behaved, and since 

— — sin 3a; = 9 sin 3a;, (17) 

we say that the number 9 is an eigenvalue of the operator —d^/dx'^, that 
the operand sin 3a; is an eigenfunction of the operator —cPIdx'^ n.nd that 

® This term is half-translated (to use Professor Kemble’s expression) 
from the German Eigenwert] alternative terms are proper value (or number) 
and characteristic value (or nuvtber). 

® Alternative terms are proper function and characteristic function. 



MATHEMATICAL PRELIMINARIES 


15 


§3 


the eigenvalue 9 and the eigenfunction sin 3x of the operator 
belong to each other. 


Our definitions can be restated as follows: an eigenfunction 
of the operator a is a well-behaved operand such that operating 
on it with a is equivalent to multiplying it by a numerical con- 
stant, this constant then being the corresponding eigenvalue of a. 
If 



(18) 


Equation (15) goes over into (5), and conscquencly the problem 
concerning (5) treated above (our first example of the eigenvalue 
problem) can be stated as follows: to find the eigenvalues and the 
eigenfunctions of the operator And our conclusions can 

be stated thus: any positive number, zero included, is an eigen- 
value of the operator --d^ldx^, and the eigenfunction of —d^/dx^ 
belonging to the eigenvalue X is (14). 

We call the totality of the eigenvalues of an operator its eigen- 
value spectrum f or simply its spectrum; thus wc say that the 
spectrum of the operator —d^/dx^ consists of all positive numbers, 
zero included. 


The eigenvalue problem, which plays a fundamental part in the quantum 
mechanics of particle motion, arises also quite frequently in classical 
theoretical physics, particularly in the theory of continuous media; but 
the definition of good behavior of a function is then usually somewhat dif- 
ferent from that adopted above, and the problem usually goes over into 
what is called the Siurm-Liouville problem. 

Exercises 

1. Show that the function cos 4a; is an eigenfunction of the operator 
d^/dx^ belonging to the eigenvalue —16. Show that the function a;c "i** 
is an eigenfunction of the operator — d*/da;* -1- belonging to the eigen- 
value 3. Show that the function sinh 2x is not an eigenfunction of the 
operator d^/dx^ in spite of the equation {d^/dx^) sinh 2x = 4 sinh 2x. 

2. Show that the operator — d*/dx* has no complex or pure imaginary 
eigenvalues. 

3. Show that, if Ui is an eigenfunction of a linear operator a belonging 
to the eigenvalue Xt , then Cui , where C is an arbitrary nonvanishing 
constant, is also an eigenfunction of a belonging to the eigenvalue Xi . 

4. The operator —d^/dx^ is of interest when the operands are explicitly 
permitted to be functions not only of x but also of other independent 
variables. Show that, if the operands are required to be well behaved so 
far as their dependence on x is concerned, then the spectrum of — 



16 


MATHEMATICAL PRELIMINARIES 


is the same as that of found in the text, and that the eigenfunctions 

of have the form (14) and can depend on independent variables 

other than x only through the coefficients A and B, 

4. The Operator — — ^ 

dx^ 


Our next problem is to determine the eigenvalues and the 
eigenfunctions of the operator 



( 1 ) 


that is, to find the values of the numerical constant X for which 
the equation au = \u possesses well-behaved solutions when a 
is the operator (1), and to find these solutions. The operator (1) 
will come into play in connection with the first dynamical system 
that we shall study. from the quantum-mechanical standpoint, 
namely, the linear harmonic oscillator. 

Remarks on curvature. If a curve u = u(,z) is concave downward wher- 
ever it lies above the a:-axis, and concave upward wherever it lies below 
the a;-axis, we say that it is concave toward the x-axis in the region in ques- 
tion; for example, the function sin x is concave toward the a;-axis through- 
out the infinite region. Now, according to a theorem in elementary 
calculus, u is concave downward if u" is negative, and concave upward 
if u" is positive. Hence u is concave toward the a;-axis if u and w" have 
opposite algebraic signs ; in other words, 

u is concave toward the a;-axis if u'^/u < 0. (2a) 

If a curve is concave upward wherever it lies above the x-axis, and 
concave downward wherever it lies below the a;-axis, we say that it is 
convex toward the x-axis in the region in question; for example, the func- 
tion X® is convex toward the x-axis throughout the infinite region. An 
argument similar to that used above shows that 

u is convex toward the x-axis if /u > 0. (2b) 

The results (2a) and (2b) will find immediate application in our study 
of the operator (1). ' 

When a is the particular operator (1), the eigenvalue equation 
Ota = \u becomes the differential equation 

The general solution of (3) cannot be expressed in any simple way 
in terms of the elementary functions; consequently, we shall have 



MATHEMATICAL PRELIMINARIES 


to proceed in a manner less direct than that of §3, and a few pre- 
liminary remarks are perhaps in order. 


Since, as the reader will verify. 


(-£ + *■) 




and the function (Fig. 3) is well-behaved, the operator (1) has the 
eigenvalue 1. Similarly, since 


4 - = Zxe'~^^\ 


and the function (Fig. 4) is well-behaved,^ the operator (1) has also 
the eigenvalue 3. The question may now be asked: Does the operator (1) 
have the eigenvalue 2? To answer it we must find whether the equation 


has well-behaved solutions. 

To get an idea of the behavior of the solutions of (6), we calculate by 
numerical integration® two particular solutions Uo and Ui of (6) satis- 
fying the following conditions at x = 0: 


C/o(0) - 1 

UtiO) - 0 


l7o(0) = 0 

C/;(0) = 1. 


Graphs of C/o and Ui are shown in the lower part of Fig. 2; in the upper 
part are plotted the parabola x^ and the constant X == 2. The graphs 


^ (In this footnote, as well as in some other places, we write exp s for «•.) 
When re -4 dhoo the first factor of a;exp(— Ja;®) approaches dhoo while the 
second approaches zero, so that the product, as it stands, becomes indeter- 
minate. To resolve the indeterminacy, we rewrite a;exp(— ia;*) as 
z/exp (iaj*) and recall (see indeterminate forms in a calculus text) that, 
if the ratio of two functions of z becomes indeterminate for some value 
of Xj then it can be replaced at this value of x by the ratio of the deriva- 
tives of the numerator and the denominator; thus, the limit of a;/exp (Ja;*) 
when a;->«ora;->— 00 may be replaced by the corresponding limit of 
[dx/dx]/ld exp {hx'^)/dx], that is, of l/z exp (J®*), and we conclude that z 
exp (— ia;®) 0 when 1 a; | — ► «>. Similar arguments show that the prod- 

uct of exp (— ia?*) and of any polynomial in z (that is, any terminating 
power series in a;) approaches zero when 1 a; | ‘->> oo . 

® Even when the general solution of an ordinary differential equation 
cannot be obtained analytically, particular solutions satisfying prescribed 
numerical initial conditions can usually be evaluated by numerical meth- 
ods; for references, see §33. 



18 


MATHEMATICAL PRELIMINARIES 


§4 


suggest that C/o is even® in or, while Ui is odd in x\ that this is indeed so 
will be shown rigorously later. Also, we observe that in the region 
I X 1 < \/2 (where the parabola in the upper part of Fig. 2 lies below the 
line X « 2) both are concave toward the a;-axis, while^in the regions 
I X 1 > (where the parabola lies above the line X = 2) both U*8 are 
convex toward the a;-axis. That the curvature of any solution of (6) has 
these properties can be shown, without reference to graphs, as follows: 
For 71 5 *£ 0, Equation (6) can be written as u”/u = a;- — 2, and therefore 

u*'/u <0 for 1 X 1 < \/2, (8) 

and 

>0 for 1 X I > \/2, (9) 


so that, according to (2a), u is concave toward the x-axis when | x | < \/2i 
and, according to (2b), u is convex toward the x-axis when | x 1 > y/2. 

Now, at X = 2, say, thfe value of Ui is seen from 



Fig. 2. Two solu- 


the graph to be positive; in view of (9), will 
remain positive for all values of x greater than 
2 as long as Ui remains positive, so that, since 
both U I and its slope are positive at x = 2, Ui 
is certain to increase without limit when x — »• oo . 
We find in a similar way tliat U i decreases with- 
out limit when x —oo, and that Uo decreases 
without limit when |x| — ► oo. [n brief, both 
Uo and Ui are ill-behaved. 

The general solution of (6) is 

u = AUo -f BUi , (10) 

where A and B are arbitrary constants. In view 
of the behavior oi Uo and Ui discussed above, 
we now conclude that, although A and B can per- 
haps be so chosen that (10) remains finite when 


tions of (6) ; curvature x oo or when x , it is certainly impos- 

exaggerated. sible to choose A and B so that (10) remains finite 

when X 00 and also when x — > — oo ,if the trivial 
case A = jB = 0 is excluded. Consequently, (6) does not possess any 
well-behaved solutions, and hence the operator (1) does not have the eigen- 
value 2; it will presently be shown, in fact, that the only eigenvalues of 
(1) are the odd positive integers. 


Let us return for a moment to (4), which shows that is an eigen- 

function of (1) belonging to the eigenvalue 1. Now, the general solution 
of the equation 



® The function u(x) is called even in x, or simply eveHf if u(x) « u(— x) 
for all values of x, and is called odd in x, or simply odd, if u(x) = — w(— x) 
for all values of x; thus cos x is even in x, sin x is odd in x, and cos (x -f- iw) 
is neither even nor odd in x. 



19 


MATHEMATICAL PRELIMINARIES 


§4 


turns out to be 

u = + BC/i , (12) 

where Ui , calculated numerically under the 
conditions UiiO) = 0 and U{{Q) = 1, is shown 
in Pig. 3, and where A and B are arbitrary con- 
stants; Ui is odd in x and is readily shown to 
be ill-behaved. Since is well-behaved 
and Ui is not, the only way to get well-be- 
haved solutions of (11) is to set B = 0 in (12), 
thus leaving out the odd part of the general so- 
lution. 

Again, (5) shows that is an eigenfunc- 

tion of (1) belonging to the eigenvalue 3. The 
general solution of the equation 

turns out to be 

u= AUo -b (14) 

where U o , calculated numerically under the 
conditions C/o(0) « 1 and C/o(0) = 0, is shown 
in Fig. 4, and where A and B are arbitrary constants; C/o is even in x and 
is ill-behaved. Since is well-behaved and Z7o is not, the only way 
to get well-behaved solutions of (13) is 
to set A » 0 in (14), thus leaving out 
the even part of the general solution. 

The principal features of the solutions 
of (3) that will be proved below and that 
have been illustrated to a certain extent 
by the examples given above are: 

(a) For any value of X, the particular 
solutions Uo and Ui of (3) satisfying the 
conditions (7) are, respectively, even in 
X and odd in and the general solution 
of (3) has the form 

AU, + BUi , (15) 

where A and B are arbitrary constants. 

(b) Unless X is an odd positive integer, 
both t/o and Ui are ill-behaved, and w is 
ill-behaved for every choice of A and B 
in (16). 

(c) If X is an odd positive integer, then either Uo is well-behaved and 
Ui is not, or Ui is well-behaved and Uo is not; well-behaved solutions 
of (3) can in this case be obtained by equating to zero the coefficient of the 
ill-behaved term in (15), and consequently the eigenvalues of (1) are the 
odd positive integers. 



Fig. 4. Two solutions of (13) ; 
curvature exaggerated. Here 
Ui » 



Fig. 3. Two solutions 
of (11); curvature exag- 
gerated. HereUo* 



20 MATHEMATICAL PRELIMINARIES §4 

A convenient procedure for a systematic identification of the 
eigenvalues of the operator (1) is the following: We let^^ 

(16) 

where v = v(x), and find by substituting (16) into (3) that, in 
order that (16) be a solution of (3), v must satisfy the equation 

v" - 2xv' + (X - l)v = 0, (17) 

where the primes denote differentiation with respect to rr. The 
boundary conditions to be imposed on v are less stringent than our 
standard conditions imposed on u; v may be allowed to become 
infinite when [ as 1 oo , provided that the product ve~^^ remains 
fibaite when j a; j — > «> . 

We now seek the general solution of (17) expressed as a power 
series: 

V ^ arX^ + + * • “ » with ar 9^ 0, (18) 

that is, 

V = aiX\ f = r, r + 1, r + 2, . . • , (19) 

with Ut 9^ 0. To identify this series, we must adjust the a's so 
that when (18) is substituted into the left side of (17) the coeffi- 
cient of each power of x in the resulting expression vanishes. 
We expect two arbitrary constants to appear in the solution, 
since (17) is a differential equation of the second order. Sub- 
stituting (18) into (17), we get 

- l)a;*”^ + a<(X — 1 — 2i)x"] = 0, (20) 

and note that terms in a:*in (20) come from two sources: from the 
first term in the brackets when i = fc 2, and from the second 
when i = fc. Hence the complete coefficient of x* in (20) is 

(fc + 2)(/b + l)aife+2 + (X - 1 - 2h)ak . (21) 


The substitution (16) is suggested by the fact that, for an arbitrary m, 
the function a;*"e“*** is, for any value of X, an approximate solution of (3) 
in the regions where | a; 1 is sufficiently large. The method for handling (3) 
that we use here is called the Sommerfeld 'polynomial method; see A. Som- 
merfeld, Wave-^Mechanics (hereafter cited as Sommerfeld), page 14, Lon- 
don, Methuen and Co., (1930). 



21 MATHEMATICAL PRELIMINARIES §4 

In order that (20) hold, the coefficient (21) must vanish for every 
value of k, so that the relation 

+ 1 — X , . 

a* (fc + l)(fc + 2) 

must hold fof every value of k. This relation enables us to calcu- 
late the coefficients at+s , ak-^ 4 , Ot+o , and so forth, if the coeffi- 
cient Ofc is available. Note that (22) involves alternate rather 
than consecutive coefficients of (18). 

To determine the powers of z with which the series (18) may 
start, we observe that the complete coefficient of the lowest power 
of X, that is, of a’’"®, in (20), is UttCt — 1), so that the equation 

r(r - 1) = 0 (23) 

must hold, and we have the two possibilities t = 0 and t = 1. 
The series, which we denote by Fo , obtained from (18) by letting 
r = 0, letting Oo = 1, and omitting the odd powers of x, that is, 

Fo( 2/) “ 1 “b CE2iC 4" (I 4 X 4“ ' * ' , (^^) 

is a particular solution of (17), provided the o’s satisfy (22). 
Similarly, the series, which we denote by Fr, obtained from (18) 
by letting t = 1, letting oi = 1, and omitting the even powers 
of X, that is, 

Fi(a;) = X + asx‘ -f aix^ + • ■ • , (26) 

is also a particular solution of (17) if the a’s satisfy (22), The 
general solution of (17) can thus be written as 

t; = AFo(i) -h BVi(x), (26) 

where A and B are arbitrary constants. 

If the series (24) happens to terminate, that is, if Fo is a poly- 
nomial, then the product Foe”*** approaches zero^ when | a; | — > 00 , 
and the first term of (26) contributes a well-behaved term to u 
when substituted into (16). But if (24) is an infinite series, then 
the product Foe”*** is ill-behaved, as the following argument 
would indicate. For large values of k, (22) goes over into the 
approximate equation ak+i/ak = 2/fc, so that for large values of k 
the ratio of the coefficients of and a:* in Fo approaches the 
ratio of the coefficients of cs*’'*'* and x** in the power series for the 
function e** ; thus Fo then behaves for large values of x somewhat 
like e**, and Foe”*** behaves for large values of x somewhat like 



22 


MA.THEMA.TICAL PRELIMINARIES 


the ill-behaved function To summarize: Fo contributes a 
well-behaved term to the solution (16) of (3) if, and only if, Fo 
is a 'polynomial^ that is, a terminating series. Similar considera- 
tions show that Fi contributes a well-behaved term to the solution 
of (3) if, and only if, Fi is a polynomial. 

Now, inspection of (22) shows, first, that the on]y condition 
under which one of the two series in (26) will terminate is that, 
for some k, 

2* 1 - X = 0, (27) 

in which case ^*+2 = akJrA = ca+o ==... = 0, and, second, that 
it is impossible to choose ^ so as to make botlj, series in (26) termi- 
nate. Thus the only way to obtain well-behaved solutions of 
(3) is 

(a) to let\ = 2ii + 1 where n is a positive integer or zero (thus 
causing Fo to terminate if ti is even, or Fi if is odd) and 

(b) to set the arbitrary multiplier of the nonterminating series 
in (26) equal to zero. 

Consequently, the only eigenvalues of the operator (1) are the odd 
positive integers 1, 3, 5, • • • ; we shall label them Xi , Xa , As ; • * • > 
according to the scheme 

\n — 2?2r -j- 1, 71 = 0, 1, 2, 3, • • • , (28) 

and shall denote the eigenfunction belonging to K by Un . This 
eigenfunction contains an arbitrary nonvanishing numerical 
factor. 

The eigenfunction of (1) belonging to a particular eigenvalue 
can he obtained by computing the appropriate solution of (17) 
and substituting the result into (16). 

To illustrate, we shall find the eigenfunction U6 belonging to = 11. 
Since n is odd, the terminating series in (26) is Fi , and the coefficient A 
in (26) is to he set equal to zero. Equation (22) gives for X = 11 


Os 

Ol 

__ 4 cts 1 A. 

3' a, 5' a, * 

(29) 

consequently, 



4 

= -gOl, 

4 

Cts ~ ~ Ul , 07 = tts <*11 = • 

•• = 0. (30) 

and the required solution of (17) is 




(31) 



23 MATHEMATICAL PRELIMINARIES §4 

This may be written as 


Aj(32a;5 - mx^ + 120a;), (32) 

where As is the arbitrary constant, and where the coefficient of rc® within 
the parentheses is made equal to 2® in order that ws may appear in the 
form that we shall later adopt as standard. Substitution of (32) into (16) 
finally yields 

Ui = A6(32a;5 - leOa:® + 120a;)e-i»®. (33) 

Hermite polynomials.” We shall now verify that the poly- 
nomial solution of (17) involved in Un can, apart from an arbitrary 
multiplicative constant, be written in the form 


e 


>2 



(34) 


note that (34) is a polynomial, since d“e '‘^/dx” is e ** times a 
polynomial. When \ = Xn = 2n 4- Ij Equation (17) becomes 


v" — 2xv' -j- 2nv — 0, 


(35) 


and substitution of (34) into this yields 


jn+2 


da :"+2 


jn+1 

+ 2x i-rr, e-" 




+ 2(n + 1) ~ = 0, (36) 


so that we shall have verified that (3A) is a solution of (35) if we 
show that (36) holds for n = 0, 1, 2, 3, • - • . Now, if we differ- 
entiate (36) with respect to x, we get the same equation as that 
obtained by writing ti + 1 foi* ^ in (36); therefore, if (36) holds 
for one value of n, it holds also for the next-higher value of n] 
but (36) is seen to hold for n = 0; hence^^ it holds also for n = 1, 
for n = 2, and so forth, that is, for all positive integral values 
of n, and our proof is completed. 

The product of (34) and the constant (— 1)"^ is called the 
Hermite polynomial of the Tith degree and is denoted by 

ffn(®) = (-l)V*^„e-**; (37) 


C. Hermite, Comptes Rendus, 68, 93 (1864), 

1* A proof of this type is called a proof by mathematical induction. 



24 


MATHEMATICAL PRELIMINAEIES 


§4 


thus 

Hoix) = 1 Hi(pc) = 2x (38a, • • • ) 

mix) = 4x* - 2 mix) = 8a:* - 12a; 

mix) = 16a:* - 48a:* + 12 mix) = 32a:* - 160a:* + 120a:, 

and so forth. 

The eigenfunction of the operator (1) belonging to the eigenvalue 
Xn = 2n + 1 can now be written as 

Un = AMx)e^^\ (39) 

where An is an arbitrary constant. The function (39) is called 
the nth Hermite function. Graphs** of Hermite functions are 
shown in Fig. 5; the choice of A« made in this figure will be dis- 
cussed in §8. 



n-3 n=4 n=5 


Fig. 5. Six Hermite functions (39), with An => (2«n!7ri)”K The hori- 
zontal and vertical scale units are approximately 1 and .1, res pecti vely. 
The thick horizontal lines extend along the oj-axes from — \/2n + 1 to 
V2n + 1. 


Drawn after those in L. Pauling and E. B. Wilson’s Introduction to 
Quantum Mechanics (hereafter cited as Pauling and Wilson); New York, 
McGraw-Hill Book Co., 1935. 




25 MATHEMATICAL PRELIMINARIES §4 

To establish certain properties'^ of the Hermite polynomials, we con- 
sider the auxiliary function 

ziXf s) = (40) 

of two independent variables x and s. The nth partial derivative of 
g-(«-x)* respect to s, evaluated at s = 0, satisfies the relation 

«-(•-*)•! =(_i)n^e-.». (41) 

|_as“ J,_„ d*” 

[Equation (41) can be proved, for example, by mathematical induction: 
we verify it for w = 0 and n = 1 and show, with the help of (36), that, 
if it holds for n = n' and for n = n' + L then it holds also for n = n' + 2.] 
Therefore, 

[a?" 

Next we expand z(xj s) into a Taylor (Maclaurin) series in s. The coeffi- 
cient of s" in the series is the nth partial derivative of zix, s) taken with 
respect to s, evaluated at a = 0, and divided by n! In view of (42), we 
then have 

^ S,(x), ft = 0, 1, 2, • • ■ . (43) 

n\ 

Thus the function (40), when expanded in powers of s, gives rise, so to 
speak, to the totality of the Hermite polynomials; for this reason it is 
called the generating function of these polynomials. 

Now, our z{x, s) satisfies the equation 

— z(,x, s) =■ 2sz(x, s), (44) 

OX 

so that, in view of (43), the relation 

( 45 ) 

n\ nl 

where the prime denotes differentiation with respect to x, holds identically 
in 8, Equating the coefficients of on both sides of (45), wc find that^*^ 

Unix) = 2»H„_i(i). (46) 

The function z(x, s) also satisfies the equation 

z(x, s) = — 2s2(a:, s) -f 2xz(^Xf $), (47) 

ds 

These arc (43), (46), (49) and (50); familiarity with their proofs is not 
essential for the present. 

H^i{x) should be interpreted as zero when (46) or (49) is used with 
n « 0. 



26 


MATHEMATICAL PRELIMINARIES 


§4 


and consequently 

n , 77, H..(®) + 2 lj« -T - 2 Z» -7 = 0. (48) 

\n — 1; ! nl n\ 

Equating to zero the coefficient of s^j get the relation^® 

Hn^iix) - 2xHn{x) + 2nHn^i{x) = 0, (49) 

called the recursion Jormula for the Hermite polynomials. 

The differential equation satisfied by Hn{x) is (35) : 

H,:'(rc) - 2xm(x) + 2nHnix) = 0. (50) 

Exercises 


1. Calculate Ht{x) in three ways: by the method used in the text for 
getting the Hi{x) involved in (32); using (37); and using (49) and (38). 

2. Show that (50) follows from (46) and (49). 

3. Verify that = — (d/dx) [/?fc(a:)e“**]; then show, using 

Rolle’a theorem^® and mathematical induction, that the n nodes of Hn(.x) 
(that is, the roots of the equation Hn.(x) = 0] are all real and distinct. 

4. Show that every solution of (3) inflects at each of its nodes, at 

X = — Xi, and at a; = X*; that every solution of (3) is concave toward the 
a;-axis in the region — Xi < a; < X*, and is convex toward the z-axis outside 
this region; that no solution of (3) can have more than one node in the 
region x < --Xi, or more than one node in the region z > X*; and that, if a 
solution of (3) has a node at a finite value of x in the region x < — Xl, or 
at a finite value of x in the region x > Xl, then this solution is ill-behaved. 
Show further that (39) has n nodes in the region Kx < ^/2n -hit 

one node at z — — « , and another at z = « , that is, n + 2 nodes in all. 

5. Show that the functions (39) satisfy the identities 


and 


nAft j An 

XUfi — . Vin—l "r _ , Wh+1 > 

An-1 2 A 04-1 



nAn An 

Un—l . U»ia,i . 

An_i 2A»+i 


(51) 


(52) 


Then deduce directly from (51) and j(52) that 

" Mn-S + i(2n + 1)«« + Un+t , (53) 

d^ n(n l)An ^ , An 

_ _ J(2n -1- Dltn -f -T Wn+a, (54) 

etz* An_3 4An+j 


^® Rolle’s theorem states that, if w(z) satisfies our standard continuity 
conditions and vanishes at z = a and at z — 6, then there is at least one 
point within the interval a < z < 5 at which u'(x) *« 0. 



27 


MATHEMATICAL PRELIMINARIES 


14 


and 



2*n!An 

(n - k)[ An-k * 


for k < n. 


(55) 


6. By a fc-fold application of (53), the function x^^Un can be expressed as 


X^^Un - ••• -f Cn-4U„_4 -f Cn-2lt«-2 + C„W„ + Cn^%y n+i 

+ CrH-4 + • • • . (56) 

Show, without evaluating the c's, but by comparing the structures of 
(63) and (54), that 

(“"l)* ss . . . Cn-iUn-A Cti—.2Un~.2 "1" CnUn “ Cn^2Un4>2 

+ Cn+4Wn+4 , (57) 

where the c’s are, respectively, the same as those in (66). 

7. Show that, if the operands are required to be well-behaved so far 
as their dependence on x is concerned, then the spectrum of — + x® 

is the same as that of ’-(P/dx^ + x* found in the text, and that the eigen- 
functions of — 9*/dx^ + X* have the form (39) and can depend on inde- 
pendent variables other than x only through the coefficient . 


6. The Operators — i — , x, and c 


An operator of fundamental importance in quantum me- 
chanics is 



( 1 ) 


where = — 1. In the case of this operator, the eigenvalue 
equation 15® (see footnote^^), that is, 

au = Xw, (2) 


In cross references, we specify the section number by a superscript 
(a notation patterned after that of E. U. Condon and G. H. Shortley). 
For example, the symbol 16* denotes Equation (15) of §3; similarly, 
^Exercise 7^^' denotes Exercise 7 of §44. Section numbers are printed at 
the right of page headings, so that the number in the upper right of a 
reference symbol is the number in the upper right of a page. Note that 
the section number heading a page on which one section ends and another 
begins usually refers to the former section. References to the Appendix 
carry the superscript A. 

The figures and the tables are numbered consecutively throughout the 
book; but to help in locating them we sometimes append the section number 
as a superscript. For example, the superscript in Tig. 22“' means that 
Fig. 22 appears in §15. 



28 


MATHEMATICAL PRELIMINARIES 


§5 


is 

• 9 

— l — U — KU, 

dx 

aad its general solution is 


(3) 

(4) 


where A is independent of rr, but is otherwise arbitrary. This 
solution is well-behaved if X is any real number (positive, nega- 
tive, or zero), but is ill-behaved if X is a non vanishing complex 
or a pure imaginary number. The spectrum of —ibibx thus 
consists of all real numbers, and the eigenfunctions of — 
belonging to the eigenvalue X are (4). This spectrum is quite 
different from the continuous spectrum of — because it con- 
tains all real numbers. 

To determine the eigenvalues and eigenfunctions of an operator 
of type c, we set a = c in (2), and obtain 

cu = \v>. (5) 


In view of the definition of the operator c, this equation is satisfied 
if, and only if, X = c, when the trivial case w = 0 is excluded. 
The operator c thus has only one eigenvalue, the number c. 
When X = c, any u — in particular, any well-behaved u — satisfies 
(5), so that any well-behaved function is an eigenfunction of the 
operator c. 

To find the eigenvalues and eigenfunctions of the operator x, 
we set a = a; in (2) and consider the equation 

XU = Xw, (6) 

Since u is not permitted to be identically zero, and since a; is a 
variable, while X is a constant, we conclude that u must vanish 
for all values of x except the value a; = X, for which it must not 
vanish. A function of this kind will be discussed at the end of 
§12, where, with the help of a nonrigorous argument, we shall 
conclude that every real number (positive, negative, or zero), 
and no other number, is an eigenvalue of the operator x. The 
operators x and —ib/dx thus have spectra of the same type. 


6. Simultaneous Eigenfunctions 
The well-behaved function 


( 1 ) 



29 


MA.THEMATIC1L PRELIMINARIES 


§6 


is an eigenfunction of the operator 
eigenvalue 3 of —id/dx, since 


-id/dx belonging to the 


• 0 zix 

— ^ — e =06 ; 

dx 


( 2 ) 


the function (1) is also an eigenfunction of the operator 
(belonging to the eigenvalue 9 of — d^/dx^), since 




( 3 ) 


An operand that is an eigenfunction of two or more operators is 
said to be a simvltw^ous eigenfunction of these operators; for 
cxan‘\plc, (1) "was shown above to be a simultaneous eigenfunction 
of ^id/dx and 

An eigenfunction u of an operator a belonging to the eigenvalue 
\ of a is simultaneously an eigenfunction of the operator a 
belonging to the eigenvalue of a ; indeed, if 

<xu = Xti, C4:) 

then au = a{(m) = o:Au = "Xau = X^u, that is, 

av, == X\. 

2/2 

Wc cau thus conclude, for example, that, since 9 /9a; — 

(— 29/0a:)*, an eigenfunction of — z9/3a; belonging to the eigen- 
value X of — i9/9j: is simultaneously an eigenfunction of -S /Sa: 
belonging to the eigenvalue A* of -aV9®“; Equations (2) and (3) 
illustrate this result for the case of the function (1). 

The converse of this theorem is in general not true: ai^ eigen- 
function of a need not he simultaneously an eigenfunction of a. 
To illustrate: the general eigenfunction of the operator - 9 /0a: 
belonging to the eigenvalue A, that is, 




(14>) u = (6) 

is, in general, not an eigenfunction of the operator -t9/0r, since 

■-i±u = - Be-^\ (7) 

dx 


» The marginal number at the left of an equation is its old er 

the number of an equation whieh, for one reason or another, might ad- 
vantageously be recalled. 



30 


MATHEMATICAL PRELIMINARIES 


§6 


so that 

^i — u 7 ^ constant X u, (8) 

bx 

unless -4, or X is zero. 

Certain pairs of operators (for example, the operators -^id/bx 
and + x^) do not have any eigenfunctions in common. 

If an operand w is a simultaneous eigenfunction of two operators 
a and jS, so that # 

au = Xu, Pu = ixUy (9) 

where X and ai are numerical constants, theri u is also an eigen- 
function of the operator aP — Pa, and belongs to the eigenvalue 0 
of ocP — poL. Indeed, in view of (9), 


aPu = afju — }iau = /iXw, 
and 

(10) 

QoiU = p\u — \Pu = 

SO that 

(11) 

{aP — Pa)u = 0; 

(12) 

Equation (12) can be written as 


{aP — Pa)ii = 0-iA, 

(13) 

and the theorem is proved. 



Exercises 


1. Show- that an eigenfunction of -ib/bz belonging to the eigenvalue X 
of --id/bz is an eigenfunction of ib^bx^ belonging to the eigenvalue 
of ib^/bx^, 

5S. Show that, if w is an eigenfunction of a belonging to the eigenvalue X, 
then u is also an eigenfunction of ca belonging to the eigenvalue c\ 

if w is an eigenfunction of « belonging to the eigenvalue a, 

^ “ tte operator a, then « is an eigenfunction 

of F(a) belonging to the eigenvalue P{a), 

4 Shw by means of an example that a well-behaved operand u may 

satisfy the equation - &cc)u - 0 without being an eigenfunction of 
eitner aor p. 

6. iMp^t the commutator of * and -id/dx, and show (without ref- 

SS.S'iT.Z""*”'”"""’ “ >-• .ton. 

6. Show that, « « and ^ commute, and if u is an eigenfunction of a 
belongmg to the eigenvalue \, thqn the operand ffu, if well-behaved is 
also an eigenfunction of « belonging to the eigenvalue 



31 


MATHEMATICAL PRELIMINARIES 


7. Let a, jS, and y be operators satisfying the equations — — 7» 

“ 7/3 = —Of, and 70: ~ 07 - — Show that, if u is a simultaneous 

eigenfunction of any two of these operators, then it is also an eigenfunction 
of the third, and the eigenvalues of a, /3, and 7 belonging to u are all zero. 

8. Show that, if a, jS, and 7 satisfy the commutation rules of Exercise 7, 

then each of these operators commutes with 7®* 

7. Degenerate Eigenvalues 

It was shown in Exercise 3^ that, if Ui is an eigenfunction of a 
linear operator a belonging to the eigenvalue Xt of a, then Cui , 
where C is an arbitrary constant, is also an eigenfunction of a 
belonging to the same eigenvalue, so that to every eigenvalue of a 
linear operator a there belongs an infinite set of eigenfunctions. 
In this connection, we must distinguish two cases: (a) when every 
eigenfunction of a belonging to X,- can be obtained from any 
other such eigenfunction by multiplying the latter by a constant 
(wo then call \i a nondegenerate eigenvalue of a); and (b) when 
not all eigenfunctions of a belonging to Xt have the form Cui , 
where Ui is one of these eigenfunctions (we then call X^ a degenerate 
eigenvalue of a). To illustrate: the eigenvalue 0 of the operator 
— is nondegenerate, because, as shown in §3, all eigen- 
functions of —d^/dx^ belonging to this eigenvalue are constants, 
so that every one of them can be obtained from any other by 
multiplying the latter by a constant; on the other hand, every 
eigenvalue of --d^/dz^ greater than 0 is degenerate, because, when 
X > 0, this operator has such eigenfunctions as 


M = ^6*** 

(1) 


(2) 


which cannot be converted into each other by a multiplication 
by a constant. 

A set of n functions, /i , / 2 , • • • , /n , is said to be linearly inde- 
pendent if it is impossible to choose n constants Ci , ca , • ■ • , Cn , 
not all of which are zeros, in such a way that the equation 

Cl/l + C 2/2 -f- * • * “t“ ^n/n = 0 (3) 

holds for all values of the arguments of the /'s; otherwise the /^s 
are said to be linearly dependent. The multiplicity j or the degree 
of degeneracy, of an eigenvalue of an operator is now defined as 



32 


MATHEMATICAL PRELIMINARIES 


follows: if the aggregate of the eigenfunctions belonging to Xt 
contains n, and not more than n, linearly independent eigenfunc- 
tions (that is, if the explicit general expression for the eigenfunc- 
tions belonging to Xi contains just n essentially arbitrary con- 
stants), then the degeneracy of X*- is said to be ? 2 -fold (except 
that Xi is said to be nondegenerate when n = 1). For example, 
the degeneracy of each eigenvalue X > 0 of is twofold, 

since the functions (1) and (2) are linearly independent, and 
since, when X > 0, Equation 5^ possesses no solutions that are 
not linear combinations of (1) and (2). 

Exercises 

1. Show that the three functions 1, cos® x, and sin® x are linearly de- 
pendent, and that the three functions 1, cos Xy and sin x are linearly 
independent. 

2. Show that none of the three operators —id/dxy x, and — 0®/da;® + x® 
has any degenerate eigenvalues. Show that the single eigenvalue of the 
operator c is degenerate to an infinite degree. 

3. Show that, if ot and /3 commute, if u is an eigenfunction of a belonging 
to a nondegenerate eigenvalue of a, and if is well-behaved, then u is 
also an eigenfunction of jS. 

4. Show that if X is real the function is an eigenfunction of every 
differential operator that commutes with ^id/dx. 

8, Orthogonal Sets of Functions 

The eigenfunctions of some of the operators of quantum me- 
chanics form so-called discrete orthogonal setSj and we shall now 
make a few remarks concerning such sets. We shall deal, in 
general, with complex functions of real variables, and shall employ 
the following terminology and notation. If fi and fz are real 
numbers or real functions of real variables, and 

/ = /i + , (1) 

where ^ = —1, we call/i the real part of /, call /2 the imaginary 
part off, and write 

/i = Ee/, and /2 = Im/; (2) 

for example, since e** = cos a; + i sin x, the real part of e*"^ is 
cos X, the imaginary part of is sin x, and we write Re e** = cos x, 
and Ln e** = sin x. The function /i — if ^ , which we denote by /, 
is called the complex conjugate of /; for example, if / = e**, then 
f == c”**. The complex conjugate of a given function is obtained 



33 


MATHEMATICAL PRELIMINARIES 


§8 


from the function itself by reversing the algebraic sign of i wherever 
i appears in the given function; it is unnecessary first to rewrite 
the function in the form (1). The function +^/fi + fl) which 
we denote by |/1, is called the modulus ^ or the absolute value j 
of/; for example, if / = e*®, then | / | = 1. The principal value 
of arc tan (/2//1) is called the phase of /. The product of f and 
its complex conjugate, that is, the square of the modulus of /, 
.will be denoted by ff or by |/ p. If Im/ = 0, so that / is real, 
the complex conjugate of / is / itself, and // = f. The product 
of a function g and the complex conjugate / of a function / will 
be written in the form rather than in the equivalent form g}; 
the convenience of this convention will become apparent when 
we come to quantum mechanics. 

If the product of a function Ui{x) and of the complex conjugate 
ihix) of a function u^ix) vanishes when integrated with respect 
to X over the interval a < x < that is, if 

U 2 (x)ui(x) dx = 0, (3) 

then ui{x) and Unix) are said to be mutually orthogonalf^^ or, simply, 
orthogonal in the interval (a, 6). 

If every two functions contained in the set 

Ui{x), U 2 {x)f Uaix), u^ix), ■ • • ( 4 ) 



are mutually orthogonal in the interval (a, b) of x, that is, if 




Um{x)un(x) dx = 0 whcu n 7 ^ m and n,m ^ 1, 2, 3, • • • , (5) 


then the set (4) is said to be orthogonal in the interval (a, b) of x. 
For example, the infinite set 


1, cos X, cos 2Xj cos 3a:, 
sin X, sin 2a:, sin 3a:, 


(6) 


is orthogonal in the interval (— tt, tt) of x because the integral of 
the product of any member of (6) and the complex conjugate^*^ 
of any other member of the set vanishes when the limits of inte- 


Some of the definitions given here differ from those usually employed 
in discussions restricted to real functions of real variables. 

Since every member of (6) is real, the words complex conjugate may 
be omitted in the present case. 



32 


MATHEMATICAL PRELIMINARIES 


§7 


follows: if the aggregate of the eigenfunctions belonging to X* 
contains w, and not more than linearly independent eigenfunc- 
tions (that is, if the explicit general expression for the eigenfunc- 
tions belonging to Xi contains just n essentially arbitrary con- 
stants), then the degeneracy of \i is said to be r^-fold (except 
that \i is said to be nondegenerate when n == 1). For example, 
the degeneracy of each eigenvalue X > 0 of —d^fdx^ is twofold, 
since the functions (1) and (2) are linearly independent, and 
since, when X > 0, Equation 5^ possesses no solutions that are 
not linear combinations of (1) and (2). 

Exercises 


1. Show that the three functions 1, cos* Xj and sin* x are linearly de- 
pendent, and that the three functions 1, cos a?, and sin x are linearly 
independent. 

2. Show that none of the three operators -id/dx, x, and — d*/dz* -f a:* 
has any degenerate eigenvalues. Show that the single eigenvalue of the 
operator c is degenerate to an infinite degree. 

3. Show that, if a and /S cominute, if u is an eigenfunction of a belonging 
to a nondegenerate eigenvalue of a, and if is well-behaved, then u is 
also an eigenfunction of 

4. Show that if X is real the function is an eigenfunction of every 
differential operator that commutes with --iO/dx. 


8* Orthogonal Sets of Functions 

The eigenfunctions of some of the operators of quantum me- 
elites form so-caUed discrete orthogond sets, and we shall now 
make a few remarks concerning such sets. We shall deal, in 
^neral, with complex functions of real variables, and shall employ 
the following terminology and notation. If /i and A arc real 
numbers or real functions of real variables, and 


f ~ fi ift , 


( 1 ) 


^ 

fi = Ee/, and fi = Im/; (2) 

cos cos a: + i sin a:, the real part of e^” is 

is called tho /. 7 ~ ) which we denote by ?, 

f th.«pfa if / = S 

ompUs conjugate of a given function is obtained 



33 


MATHEMATICAL PRELIMINARIES 


§8 


from the function itself by reversing the algebraic sign of i wherever 
i appears in the given function; it is unnecessary first to rewrite 
the function in the form (1). The function + /?, which 

we denote by |/|, is called the modulus j or the alsoliUe value, 
of /; for example, if J = e*"", then |y ] = 1. The principal value 
of arc tan C/2//1) is called the phase of /. The product of f and 
its complex conjugate, that is, the square of the modulus of /, 
will be denoted by ^ or by |/ f. If Imf = 0, so that / is real, 
the complex conjugate of / is / itself, and jj = f. The product 
of a function g and the complex conjugate / of a function / will 
be written in the form rather than in the equivalent form gf ; 
the convenience of this convention will become apparent when 
we come to quantum mechanics. 

If the product of a function Ui(x) and of the complex conjugate 
of a function zk{x) vanishes when integrated with respect 
to X over the interval a < x < h, that is, if 

ilii{x)ui(x) dx == 0, ' (3) 

then ui{x) and 1 x 2 ( 0 ;) are said to be mutually orthogonal, or, simply, 
orthogonal in the iiiterval (a, 6). 

If every two functions contained in the set 

'ihix), 1x3(0;), 1x4(0;), • • • ( 4 ) 



are mutually orthogonal in the interval (a, h) of x, that is, it 


r 


Um{ixi)un(x) dx = C whsH u 9 ^ m and n,m = 1, 2, 3, 


( 5 ) 


then the set (4) is said to be orthogonal in the interval (<i, 6) of x. 
Tor example, the infinite set 


1, cosa, cos2r, cos 3a;, 
sin X, sin 2r, sin 3a;, 


( 6 ) 


is orthogonal in the interval ( — jr, r) of x because the integral of 
the product of any member of (6) and the complex conjugate®® 
of any other member of the set vanishes v?hen the limits of inte- 


Some of the definitions given here differ from those usually employed 
in discussions restricted to real functions of real variables. 

Since every member of (&) is real, the words complex conjugate may 
be omitted in the present case. 



34 


MATHEMATICAL PEELIMIN ARIES 


gration axe — -tt and tt; the set (6) is in fact orthogonal in any 
interval of x of length 2 t. 

The set (6) is of great interest in mathematical physics because 
a large class of functions^^ of x can be expressed in an interval of 
length 2r as linear combinations^ of members of (6), so that a 
large class of periodic functions having the period 27r can be ex- 
pressed in this manner in the infinite interval ( — «> , <")• When 
a function is expressed as a linear combination of the members 
of (6), it is said to be expanded into its Fourier series. Certain 
Fourier series are encountered in elementary trigonometry; for 
example, the right side of the familiar identity 

sin^ :r = I — I cos 2x (7) 


is the Fourier series of the function sin^ x. 

The Fourier expansion (that is, the Fourier series) of a given 
function, if it exists, can sometimes be found in several super- 
ficially distinct ways; for example, the expansion (7) can be 
carried out by trigonometric or by equally elementary algebraic 
methods. But a general method exists for lading the Fourier 
expansion of a given function by means of integration; this method 
applies equally well to orthogonal sets other than (&), and we 
shall now present it for the general case. 

If it can be shown that in the interval (a, b) the function f(x) 
admits of an expansion 

/ == CiUi + C2%h + + • " + CkUk + • • • (8) 

in terms of the set 

% , , (9) 

which is orthogonal in the interval (a, b) then the expansion 
coe^dmts^^ that is, the c^s in (8), can be evaluated as follows: 

1) write (8) with the c’s undetermined; 

®iOnly highly discontiauous fuactions (not satisfying the Dirichlet 
conditions discussed in mathematical boohs) cannot be expanded into 
F ourier series in a finite interval. 

22 By a linear combination of the functions Ui , , ua , • • - , we mean 

an expression of the form CiUt -j- Ciiii -h •+■•••, where the c’s are 
constants. 

Expansion coefficients are sometimes called Fourier coejfficients, even 
when sets other than (6) are concerned. 



35 


MATHEMATICAL PRELIMINARIES 


2) multiply (8) through by tt* , getting 

Ukf = CiUkUi + CiUkUt + CiiikUt + • • • + CkUkUk + • • • ; ( 10 ) 

3) integrate (10) term by term over the interval (a, b ) ; in view 
of the orthogonality of the u’s, all terms on the right side of (10) 
except that involving UkUk will vanish, and hence 


so that 



i: 


Ck I . Wfc dXy 


( 11 ) 


Ck 



Ukfdx 


ilk Uk dx 


fc = 1, 2, 3, 


... ( 12 ) 


Note that (12) was derived on the assumptions that f(x) does 
admit of an expansion of type (8), and that a term-by-term 
integration of the right side of (10) is permissible; hence the fact 
that the c’s given by (12) can be computed for a given f(x) and 
a given orthogonal set (9) does not guarantee that the expansion 
(8) exists; and therefore the use of the method of integration 
for the evaluation of expansion coefficients should be accom- 
panied by an investigation of the question whether the series so 
obtained converges to the values of /(*) for the value of x under 
consideration. If a computation presumes the convergence of 
series or integrals without proving the validity of this presumption 
for the case on hand, we call it a formal computation. Inci- 
dentally, the fact that the c’s are given by (12) if the expansion 
(8) exists shows that the expansion of a given function in terms 
of a given orthogonal set, if it exists, is unique. 

In the case of some of the more important orthogonal seta, 
general conditidns have been set up that enable us to determine, 
prior to a formal evaluation of the expansion coefficients, whether 
or not the expansion is possible; for example, it is known that 
every continuous®* periodic function of period 2ir can be expanded 
in terms of the set (6). If every function of a certain type admits 
of an expansion in terms of a given orthogonal set, the set is said 
to be complete with respect to functions of this type; for example, 
the set (6) is complete with respect to continuous periodic func- 
tions of period 2x. 



36 


MATHEMATICAL PRELIMINARIES 


§a 


To illustrate the use of (12), let us obtain the Pourier series of the 
f unction f{x), shown in Fig. 6 and defined as follows : 



— (x 4 tt) 

for 

-nr < X < -Jir 



TT 




m = < 

2 

~x 

for 

“Jn* < 

(13a) 


{x - t) 

for 

iw < X < T, 



IT 




f{x + 2t) = /(as) 

for 

— 00 < X < 00 . 

(I3b> 



This function is continuous, and hence we are assured that the Bcriea ob 
tained by a formal use of (6) and (12) will represent it. Writing? 

/(r) = ao -f ai cos a; -h 02 cos 2a; • • • 4* cos -h • • 

4 hi sin a; 4 ^>2 sin 2a? 4 * • * 4 sin ria; 4 • • 

we have, in view of (12) : 


(14) 


o„ = 


£ 


cos nxf(x) dx 


£ 


’ 0, 


eos® fi* dx 


(15) 


t>.= 


f? 


sin nxfix) dx 


if /'-br 2 1 

/ r ” ^ w I — 4 sin Tta; (ia; 

sin* n-a; da: w-*" L J 


i .» (16) 

mteryal of integration is symmetric about s = 0. Consequently the 
Fourier senes of the function shown in Fig. 6 is ^oasequenuy, tno 




^ * 3. +■ I si“ 5* - i sin 7* -h • j. 


(17) 



37 MATHEMATICAL PRELIMINARIES §8 

We shall now define certain additional terms. If the integral 


uudx (18) 

is finite, the function u is said to be of integrable square, or quad- 
ratically integrable, in the interval (a, b). 

If u and all of its derivatives approach zero when ] a; | -^ oo , 

and if 1 uu dx is finite, we call u a normalizable function (this 

J— 00 

use of the term is not standard in the literature) ; functions, such 
as e”* sin (6® ), which are quadratically integrable, but whose 
derivatives do not vanish when | a: | — » oo , will not be of interest 
to us. 

If 

b 

uudx = If (19) 

the function u is said to be normal, unitary, normalized to unity, 
or, simply, normalized, in the interval (a, 6). 

If every function in the set 



Ui,U2,Ui, ‘ • ( 20 ) 

is normalized to unity in (a, b), the set (20) is said to be normal, 
unitary, normalized to unity, or, simply, normalized, in the inter- 
val (a, b). 

If a set is both normal and orthogonal in an interval, it is said 
to be orthonormal in this interval. If the set (9) is orthonormal 
in (a, b), the formula (12) for the expansion coefficients in (8) 
becomes 

~ fc == 1, 2, 3, • • • . (21) 

In speaking of quadratic integrability, orthogonality, and nor- 
malization, we usually omit a specific mention of the interval 
whenever it can be readily inferred from the context. 

If a function f, which is of integrable square in {a, b) and is 
not identically zero, is multiplied by a constant N such that 

= [fysdxy, 


( 22 ) 



3S 


MATHEMATICAL PRELIMINARIES 


§8 


then the function Nf is normalized to unity in (a, 6). A number 
N satisfying (22) is called a normalizing factor of /; note that 
(22)., regarded as an equation to be solved for iV', determines 
only the modulus of N and leaves N indeterminate to the extent 
of a phase factor where y is an arbitrary real constant, the 
phase of N, Consequently, the process of normalizing a given 
function does not lead to an unambiguous result unless the phase 
factor is specified. 

By normalizing every function of a set, we may convert the 
set into a normalized set. To illustrate; the set (6), as it stands, 
is not normalized in the interval (—tt, t) because 

/ dx == 2r, I co^hxdx = I sin^kxdx = tt; (23) 

■It J—r T 


to normalize it, the function 1 must be multiplied by a constant 
of modulus (2Tr)~^, and each of the other members of the set must 
be multiplied by a constant of modulus so that the set 




e’'^‘ 

— ,= cos z, 

V IT 

—7= cos 2a;, 

V or 

e’®* . 

— ^ sin X, 

V ^ 

—7= sin 2a;, 

V -TT 


gns 

COS 3r, • • • , 

VTT 

—j=z sin 3a;, • • ‘ , 


(24) 


is- normal and orthogonal in (— tt, t), the 7 ’s and 5*s being arbitrary 
real constants. Any particular choice of these arbitrary phases 
leads to a distinct orthononnal set, and, in order that a set be 
completely specified, each phase factor must be fixed by some 
suitable convention. For example, we may choose each y and 5 
in (24) equal to zero, and get the set 


1 


1 

cos a;, 

V TT 

1 . 

X, 

V TT 


cos 2x, 

V r 

sin 2a;, 

V T 


—=z cos Sx, • • • 
V 

— sin 3x, • - - 

v TT 


(25) 


which is orthonormal in the interval (— tt, tt) and is free from 
ambiguity. 

If an orthogonal set is given, then other orthogonal sets may be 



39 


MATHEMATICAL PEELIMINARIES 


§8 


constructed from it, the members of a new set being special linear 
combinations of the members of the original set. To illustrate: 
the two functions c** = cos x + ^ sin x and = cos x i sin x 
are mutually orthogonal in ( — x, ?r), and each of them is orthogonal 
to every member of (6) except cos x and sin x. Therefore, the set 

1, cos 2x, cos Zx, cos 4a;, • • • ^ (26) 

sin 2a;, sin 3a;, sin 4a;, • • • / 

is orthogonal in the interval (— tt, tt). In a similar way, we may 
convert (6) into the set 

• • • , e2xx g3« . . . ^ (27) 

Normalizing (27) to unity in ( — tt, tt), and setting the phases of 
the normalizing factors equal to zero, we get the symmetric 
orthonormal set 




n = 


" 2 , - 1 , 0 , 1 , 2 , 


(28) 


which is in many respects more convenient than (25). We shall 
call an expansion in terms of (28) a Fourier series; no confusion 
need arise regarding the use of this term also in connection with 
expansions in terms of (6), because either expaasion can be imme- 
diately converted into the other through the relations = 
cosnx 4- i sinnx, sinnx = — + lie"**"*, and cosnx = 

Many of our equations will contain definite integrals, usually 
involving ‘certain standard limits of integration. In the main 
one-dimensional equations, these limits will be indicated ex- 
plicitly; but to simplify the printing of numerous auxiliary equa- 
tions we shall let the symbol J /(s) ds (note the crossed d) stand 
for the definite integral of J(s) taken over the complete range of 
the variable a or, if s is an angle, over its principal range. For 
example, in the case of the variable x, whose range extends from 
— 00 to 00 , we set 




J—oo 


dx. 


(29) 



^0 


MATHEMATICAL PRELIMINARIES 


Exercises 


1 . Outline the procedure for deriving (7) by means of (12). 

2. Show that each of the two sets 



is orthononnal in any interval of length L. These sets are used for the 
expansion of periodic functions of period L, and the resulting series are 
called Fourier Beries, just as in the case of the sets (25) and (28). 

3. Show that, if the function cos 137® is omitted from (6), then the 
reniaining orthogonal set is no longer complete with respect to continuous 
periodic functions of period 2ir. 

4 . Show formally that if a function /, which is unitary in the interval 
(a, h), admits of the expansion / « ciUi cjwj +• ciUi 4- • • • in terms of a 
set til , ua , uj , • - , which is orthonormal in the interval (a, h), then 

E»l Cibl* =1, fc ■= 1, 2, 3, ••• . (32) 

5 . Using repeated integration by parts^^ and the formulas 37^ and 46^^ 
show that the Eermite functions 39^ form an orthogonal set in the infinite 
interval (—oo, »), and that normalization of this set yields 


V'2»«u» 


^ ® 0 , 1 , 2 , 3 , ‘ * f 


( 33 ) 


where each yn is an arbitrary real constant. Graphs of the first six mem- 
bers of (S3), ith the 'y’s set equal to zero, are given in Fig. 5. 

The Hermite functions form an orthogonal set that is complete with 
respect to normalizable functions satisfying certain mild continuity condi- 
tions.** An expansion in terms of the Hermite functions is called a Grom-- 
Charlter series. 


Formula for indefinite integration by parts (primes denote differentia- 
tion with respect to z) : / uv'dx = uv — I u'vdx; for definite integration by 

• P tr rt ,6 

parts: j uv'dx = nv - / u'vdx, where uv\ = u[h)v[h) — u(a)v(a). 
a Ja. |<i 

Hilbert, Methoden der Mathematischen Physik, 
vol. I, Second Edition, page 81. Berlin: Springer, 1931. 



41 


MATHEMATICAL PRELIMINARIES 


6. Recall Exercise 5* and show that the expansion of the function xun 
[where Un is given by (33)] in terms of the set (33) is 

XUr, = Vin -f VKtT+T) Un+I. (34) 

In a similar w'ay, obtain the expansion 

“ Wn - Viri Wn-i ~ Viin + 1) Wn+i , (35) 

ax 

and also the expansions for xHn and (d^/dx^)un . 

7, According to (21), the coefficient Ch in the expansion of the function 
xun in terms of the orthonormal set (33), that is, in 

iJJWn = CkUk, fe - 0, 1, 2, • • • (36) 

is [remember (29) I 

Ck - } HkXUndx, ( 37 ) 

Now, comparison of (36) with (34) shows, for example, that 

Cn-i - (38) 

so that in view of (37) we have 

/ iZn^iXUndx - Vin (39) 

a result which can be written in the form 

if w = n 4- 1, then / UnXUmdx — + 1) (40) 

This calculation shows that, in view of the uniqueness of the expansion 
of a given function in terms of a given orthogonal sot, if the expansion 
coefficients can be found in some way without recourse to integration — 
as were the coefficients in (34)~-then certain definite integrals can be 
evaluated by comparing the known coefficients with (12). 

Working along these lines with the functions (33),' verify the following for- 
mulas, in which the symbol ’ stands for the phase factor so that 

the entry y/in* in the first line of (41) stands for \/jw exp [—i(yn — Tn-i)], 

the entry H- 1)' in the second line of (41) stands for \/J(n 4- 1) exp 

[~t(Tn - 7 i»+i)], and so forth: 


/ 

/ 


HnXUmdX « 


UjndX ' 

dx 


Vfl’ 

if ??i “ n — 1 


Vi(n+iy ■ 

if m » n + 1 

(41) 

0 

otherwise. 


-VR 

11 

1 

>-* 


Vi(n + ir 

if m “» 71 + 1 

(42) 

0 

otherwise# 




mathematical preliminaries 




f 

I 


iV(« - l)tt’ 

ii m=> n — 2 

H2n + 1) 

if m = n 

iV'(n + lXn-j- 2 y 

if = n + 2 

0 

otherwise. 

iV(n - l)n' 

1 

11 

-i(2n -h 1) 

if m = n 

i V(n+ 1)(7i + 2 )’ 

if w = n H- 2 

0 

otherwise. 


8. Recall Exercise 6<, and show that for m„ given by (33) 


' = ( — 1 )* j 


( 45 ) 



Chapter II 

REMARKS ON CLASSICAL MECHANICS 


9. Hamiltonian Functions 


According to the classical theory, the one-dimensional motion 
(which we shall take to be along the a:-axis of a Cartesian co- 
ordinate frame) of a particle of mass m acted on by a force / is 
described by the differential equation 

m± = /, ( 1 ) 


which can be inferred from Newton’s laws of motion; the symbol x 
in (1) stands for d^x/df: we use dotted symbols for total deriva- 
tives with respect to the time t Given a force field, Equation (1) 
enables us to express the position x of the particle as a definite 
function of the time, provided the initial conditions of the motion 
are known, so that the two arbitrary constants in the general 
solution of (1) can be evaluated. 

The force field / is said to be conservative if a function V{x)f 
called the 'potential function of the force field, or simply the poten- 
Half can be found such that^ 


dVjz) 

dx 


( 2 ) 


When a conservative force field is given, the corresponding poten- 
tial, as found by the use of (2), is indeterminate to the extent of 
an arbitrary additive constant; this constant can be given a 
specific value by choosing the origin from which the potential is 
measured. We restrict ourselves to conservative fields. 

If the field is conservative, (1) can be written as 


mx = 


dV(x) 
dx ' 


(3) 


1 In the one-dimensional case, ordinary rather than partial derivatives 
may be used on the right sides of (2) and (3). Equation (2) means essen- 
tially that the force acting on the particle depends only on the position 
of the particle— a condition which, in the one-dimensional case, assures 
conservation of mechanical energy. 

43 



44 


CiASSICAL MECHAJ^ICS 


and "we get the familiar resnlt that the quantity 

W + V{x) (4) 

is independent of the time, and is thus a, constant of motion (or an 
^integral’ of the motion) ; (4) is called the total energy, or simply 
the energy of the particle, and is the sum of the kinetic energy 
\rifd^ of the particle, and the potential energy V(x) of the particle. 
We shall denote the numerical value of the energy by the letter E, 
although the letter W also is often used for this purpose. The 
constancy of (4) is referred to as the conservation of the mechanical 
energy of the particle, and accounts for the use of the term .con- 
servative for one-dimensional force fields of type (2) . 

The quantity 

p = mx (5) 

is called the momentum (of the particle) associated with the 
Cartesian coordinate x. 

We refer to a particle executing one-dimensional motion as a 
one-dimensional dynamical system, and speak of such quantities 
as the position x, the velocity v the momentum p, the energy 
E, and so forth, as dynamical variables, even though some of 
these, such as E and E^, may, in fact, be constants. We often 
substitute the word system for the word particle, and say ^energy 
of the system’ for ^energy of the particle,’ and so on; also, we say 
‘conservative system’ for system consisting of a particle moving 
in a conservative field.’ There are as many different one-dimen- 
sional systems involving a single particle of mass m as there are 
force fields in which a particle can execute one-dimensional 
motion. 

The energy of a one-dimensional system expressed in terms of x 
and p (not in terms of x and v) is called the Hamiltonian function 
of the system, or simply the Hamltonian of the system, and is 
denoted by H, In the case of a conservative one-dimensional 
system, we have, in view of (4) and (6), 

H + Vi::). ( 6 ) 

To symbolize that the energy of a system is conserved and is 
•numerically equal to Ewe write: 


H = E. 


(7) 



45 


CLASSICAL MECHANICS 


§9 


Equation (1), a single ordinary differential equation of the 
second order, is equivalent to the system of two partial differential 
equations of the first order, 


dJl 

dp 


= X, 



( 8 ) 


called Hamilton's canonical equation; these equations follow di- 
rectly from (1), (5), and (6): dH/dp = p/m = Xj and dH/dx = 
dV{x)/dx = — / = —mx = —p. 

If two dynamical variables, say r and s, are of such a nature 
that the Hamiltonian can be expressed entirely in terms of them, 
and that when this is done the equations 


dH . dH 

ds dr ^ 


(9) 


hold, the variables r and s are said to be canonically conjugate. 
Comparison of (8) and (9) shows that the position x and the 
momentum p of a particle executing one-dimensional motion are 
canonically conjugate regardless of the specific form that the 
potential 'V{x) may have. 

A classical dynamical variable to which we shall refer early in 
our study of quantum mechanics is the so-called Poisson bracket 
of a pair of variables. The Poisson bracket of two dynamical 
variables, say the variables J and rj, is denoted by [J, ri] and in 
the case of one-dimensional motion is defined as 


If. »] = 


drj 

dx dp dp dx' 

For example, the Poisson bracket of the variables x and x^ is 


( 10 ) 


r 2 i diC dx dx dx 1 A Art A 

[XfX] = — — = 1*0 - 0.2a; = 0; 

^ dx dp dp dx ' 

the Poisson bracket of x and p is 


( 11 ) 


f I dx dp dx dp - - « ^ . 


and the Poisson bracket of xp and x is 


( 12 ) 


[xp, x] 


dxp dx 
dx dp 


dxp dx . . 

— •— = p-0 — x-1 = —X. 
dp dx 


(13) 



46 


CLASSICAL MECHANICS 


Insofar as our discussion is concerned, it is important to remember 
the definition ( 10 ) of the Poisson brackets, but acquaintance with 
their use in classical mechanics is not essential. 

Our attention will be focused on dynamical variables which 
(like X, z^, Pf and ax^ +• where a and b are constants) can be 
expressed entirely in terms of the variables x and p, that is, which 
(unlike t, xt, and p¥) do not involve the time t explicitly. 

Exercises 

1. Show that 1^;*, =. 4a;p, [/(a?, p), f{x, p)l = 0, [fix), gix)] = 0, and 

IKv), 9 (V)] = 0. 

2. Show that, if the Greek letters are dynamical variables pertaining 
to a one-dimensional system, and c is a constant, then 

[^, v] = — U, ^1 
[^, cl « 0 

[|x + ^2 I I?] *= [^1 I I?) + [^2 , V] 

[f, Vl + ’721 = [|, 171] + [|, 272] ( 14 :a-g) 

[$l$2 , 17 ] = [$l » + $l[$2 > v] 

[$» 271’72] ’7il’72 + Vil^} ’ 72 ] 

[^1 ivf fll + iv) [r? ^1] + II dl = 0. 

Note the resemblance of these relations to the relations 29* among com- 
mutators of linear operators. 

3. Show that, if $ is any dynamical variable pertaining to a one-dimen- 
sional system, then [^, p] = d^/dx, and [f, x] = —d^/dp, so that, in particu- 
lar, Hamilton’s canonical equation can be written in the symmetric form 

X = [a:, H], f - [p, H], (15) 

4. Show that, if £ is a dynamical variable which does not explicitly 
depend on the time t, then 

k “ [^, m (16) 

Thus, in particular, [$, H] vanishes if, and only if, { is a constant of motion. 

6. Denote the elevation of a ball thrown vertically upward by., a;, and 
show that (2) holds if the motion is frictionless, but does not hold otherwise. 

10. Examples of Classical Motion 

We shall now consider briefly some aspects of the classical 
theory of certain one-dimensional systems that we shall presently 
consider from the quantum-mechanical standpoint, since later it 
will be instructive to compare the respective conclusion of the 
two theories. 



47 


CLASSICAL MECHANICS 


§10 


The free particle. The simplest one-dimensional problem of 
classical mechanics is that of a free particle, that is, a particle 
not acted on by any forces. In this case, 1® is 

mx = 0, (1) 

so that 

X = At B, (2) 

where A and B are arbitrary constants. When the initial condi- 
tions of the motion are, 

at < = /o , X = xq and p = Po , (3) 

Equation (2) becomes 

X -U) +x^, (4) 

m 


According to classical theory, the physical meaning, that is, the 
experimental implication, of (4) is that, if the position and the 
momentum of a free particle of mass m are measured experi- 
mentally (precisely, simultaneously, and without disturbing the 
particle) at a time h and are found to be xq and po , and if another 
precise experimental measurement of position is to be made at a 
time then the result of the latter measurement is certain to be 
the number obtained by substituting the numerical values of m, 
, Po , ico , and t into the right side of (4). [Classical equations 
of the type x = F{t) also permit us to form a mental picture of 
the manner in which the particle behaves between observations.] 
Integrating (1) once, we find that the momentum of the free 
particle is a constant of motion, so that, under the conditions (3), 

P(0 == Po . (5) 

In view of 2*, the potential describing the field-free case is a con- 
stant, and we may, without loss of generality, set 

V(x) = 0. (6) 

The total energy then equals the kinetic energy: 


Ut 1 _2 ^ 

^ 2m^ 2m^“' 


(7) 


The linear harmomc oscillator. Next we consider the one- 
dimensional system known as a linear harmonic oBcillator^ that is. 



48 


CLASSICAL MECHANICS 


§10 


the system consisting of a particle of mass m moving on the 
fl;-axis under the influence of a restoring force that is proportional 
to the displacement of the particle from some fixed point on the 
axis, say the point ai = 0. If the force per unit displacement 
is — fe, the equation of motion is 

mx = —kx. (8) 

Using the abbreviation 

w = Vfc/m, (9) 

we write the general solution of (8) as 

a: = .4 cos -f- J? sin cot, (10) 

where A and B are arbitrary constants. The particle thus exe- 
cutes a simple harmonic motion of frequency 



If the initial conditions are (3), Equation (10) becomes 

a; = ao eoSw(« - to) + ^ sin«(< - k), (12) 

and the momentum at the time t is 

p = - xoom sin u{t - to) -{■ pa cos u{t - to). (13) 

Although (12) exhibits it as a function of the time in a most 
explicit fashion, another expression for x will also be of interest 
to us. Our system is conservative, and the force ~kx is the 
negative gradient of the potential function 

Vix) = ^]tx\ (14) 

the arbitrary constant in Y (x) having been set equal to zero, as 
we shall usually do in studying the oscillator. The Hamiltonian 
of the system is 

so that, if the energy of the oscillator has the numerical value E, 
we have 



49 


CLASSICAL MECHANICS 


§10 


Now, the general solution (10) can be rearranged so that E 
appears as one of two arbitrary constants. Let a constant r be 
defined by sin ur = A/ (A^ + then (10) becomes x = 
(A + B) sin w(t + r), and consequently p = ma)(A° + 5*)* 
cos u(t + t); substitution of these expressions for x and p into 
(16) yields 4- = 2E/k = 2E/mc/, and we finally get 

2iE 

Y sin o>(t + t) (17) 

and 

V = \/2mE cos u(t + t). (18) 

Ihe two arbitrary constants in (17) are E and r; when the general 
solution of the equations of motion of a dynamical system is 
written in such a way that one of the arbitrary constants, say t, 
and the time i appear only in the combination t + r, then t is 
called the phase of the motion. 

Incidentally, Equation (17) shows that according to classical 
theory a precise experimental measurement of the position of a 
particle comprising a linear harmonic oscillator of energy E cannot 
yield a result lying outside the region -■s/iiffk <x< ■\/2Ejk] 
for this reason the positions -\/2E/k and V2E/k are called the 
classica l lim its of x of the oscillator. The momenta —■\/2mE 
and '\/2mE are, for a similar reason, called the classical limits 
of p of an oscillator. 

We often find it convenient to represent the potential y{x) of 
a one-dimensional system and the energy E of the system in a 
single graph, E being represented by a horizontal line. Such an 
energy diagram for a linear harmonic oscillator is shown in Fig. 7, 
where the parabola is the potential 
function l^ka^, and where xi and xt 
are the classical limits of motion. 

Note that an energy diagram makes 
no reference to the time t. 

Potential wells. Let three me- 
tallic cylinders (the two outer ones 
infinitely long) and the fine grids 
across their ends be maintained at 
the electric potentials indicated in 
Fig. 8 (a)j the potential field. Fig. 8(b), acting on a particle car- 
rying a unit positive charge and moving along the avig of the 





50 


CLASSICAL MECHANICS 


§10 




Fig. S. A potential well. 


cylinders then has the form of a well If the members of each 
pair of adjacent grids are imagined to approach each other, we 
get the idealized potential of Fig. 9, sometimes, called a (one- 



Fig. 9, A rectangular potential well. 


dimensional) rectangular potential well] the vertical lines at 
X = d=a in Fig. 9 are drawn to help the eye. If the potential 
Fa in Fig. 9 is increased indefinitely, we get the potential de- 
scribing what is often called a (one-dimensional) hox with per- 
fectly reflecting walls; this field is usually represented graphically 



as in Fig, 10, where the ‘box' extends 
from a; = 0 to a; = L 

Exercises 

1 , What are the experimental implica- 
tions of the classical results (13) and (16)? 

2. Evaluate r in (17) if at i = 0 the par- 
ticle is at a; « 0 and is moving toward the 
right. 



51 


CLASSICAL MECHANICS 


3. Show that the coordinate x, the momentum p, and the variables 
X*, p®, and xp for a linear harmonic oscillator can be expressed as functions 
of t by means of the following Fourier series, in which 7 (= wr) is a real 
constant : 


(19) 


(20) 


(21) 

p® « e®*"* +mE [imEe'^^y] e“®*"* 

(22) 

= {“f + {f /j/f 

(23) 


4. Show that, if a particle of mass m and energy E movies in the one- 
dimensional box of Fig. 10, then 


X 


1 2li 

2 


(- 1 )* 

(2fc+.l)2® 

fc* -1,0,1,2, 


•, (24) 


where and where r « flr, t being the phase. Evaluate y 

if at i = 0 the particle is at x « il and is moving toward the right. 


11. Distribution Functions 

The type of experiment that is in some respects most informa- 
tive consists in determining experimentally the condition of a 
system at an instant and then measuring the value of some 
dynamical variable at a later instant In order that a theo- 
retical calculation may be carried out to the point at which it 
becomes .directly useful in interpreting the results of such an 
experiment, the person performing the theoretical computations 
must be informed, first of all, as to what kind of a system is under 
consideration, so that he may properly choose the force field / 
in the fundamental equation mt — f) secondly, he must be given 
information regarding the condition of the system at the instant 
< 0 , or, as we shall say, the staie of the system must be specified 
for him. In classical mechanics, the standard method of speci- 
f 3 dng the state of a system consists in quoting the initial condi- 



62 


CLASSICAL MECHANICS 


§11 


tions of the motion, as was dorie in 3^^ or by providing information 
frojaa which xq and po can be unambiguously determined. A self- 
consistent set of data is called a maximal set if it describes the 
condition of the system at some instant in all theoretically attain- 
able detail; according to classical theory, the initial conditions 3^*^ 
are an example of a maximal set of condftions for a particle 
executing one-dimensional motion. 

If the information concerning the condition of the system at 
the instant U available to the experimentalist does nqt constitute 
a maximal set of conditions, he will know that the results of at 
least some of the experiments which might be performed at the 
instant t are not certainties, and will not expect the theoretical 
computer to provide unambiguous answers to all questions; again, 
the computer will not be able to carry out his work entirely on 
the basis of the laws of motion, but will have to be guided also 
by considerations of probabilities. 

To illustrate: We are requested to provide theoretical data concerning 
the outcome of a precise measurement, to be made at the instant of the 
momentum p of a particle of mass m moving in the potential box of Fig. 10, 
and we are told that 

(a) the energy of the particle is certainly E 

(b) no other information is available. 

Under these circumstances, we are obviously not able to compute the 
result of the contemplated measurement with certainty; yet we can tell 
what the possible results and what their respective probabilities are. In 
fact, we can draw from the available information the conclusions con- 
cerning the outcome of a precise measurement of the momentum at the 
time t summarized in Table 1. 

Table 1 

Possible results of a precise measurement of p '\/2mE —\/2mE 
Respective probabilities J J 

Discrete distributions. Let the different possible results of an 
experiment, yet to be performed and designed to measure pre- 
cisely the value of a dynamical variable, say s, be Si , sa , Sa > and 
so forth, and let the respective probabilities of getting these 
results be P(si), Pfe), and so forth. Then the list of the 
s's and the P*s shown in Table 2 is called the expected absolute 
distribution function of s, or the distribution function of s, or 




53 


CLASSICAL MECHANICS 


§11 


Table 2 


Possible results of a precise measurement of s 

Si 

S2 

Ss 

, , . 

Respective probabilities 

P(s.) 

PM 

PM 



simply the distrihution-in-a. Table 1, for example, is the expected 
absolute distribution function of the momentum of our boxed 
particle, or the distribution function of the momentum of the 
particle, or simply the distribution-in-p of the particle. The word 
expected appears in the full title of the term to suggest that the 
measurement is yet to be performed, a circumstance that must be 
kept in mind when the abbreviated terminology is employed.^ 
The word absolute is used to distinguish the present case from 
relative distribution functions to be discussed later. If the possible 
results of a precise measurement of s, or, as we shall often say for 
short, 'the possible values of 5,’ are separated from each other by 
finite amounts, then the distribution-in-s is said to be discrete. 
The concept of a distribution-in-s implies, of course, that s can be 
measured with absolute precision. The probabilities P(St-) occur- 
ring in an absolute distribution satisfy the relation 

E.P(sO = l, t = 1,2,3, . (2) 

A quantity of considerable interest to us is 

EiS.- ■?(«.), f = l,2, 3, •••, (3) 

called the mathematical expectation of s, or simply the expectation 
of s. The expectation of s is the arithmetic mean of the results 
that our contemplated precise measurement of ^ might yield, each 
result being weighted by its estimated probability. Thus the 
expectation of the momentum of our particle in the box is, accord- 
ing to Table 1, 

(\/2mP)® + (- V2mP)(|) = 0; (4) 

this example shows that the expectation of a quantity need not 
itself be a possible result of a precise measurement of the quantity. 

® The term distribution function is used also to denote a list of experi- 
mental values and of the frequencies of occurrence of the individual values 
(or values in specified ranges) found by a series of experiments on a single 
system or on an assemblage of similar systems; we shall have no occasion to 
use the term in these senses, although in a rigorous formulation of quantum 
mechanics the consideration of assemblages is quite important. 



54 


CLASSICAL MECHANICS 


§11 


Although the terms iriaihematical expectation of s ox expectation 
of 8 seem to be the only ones that can be legitimately used for the 
ciuantity (3), it has become a custom in quantum-mechanical 
literature to speak of (3) as the average of s, even though the 
unqualified term average usually refers to the arithmetic mean of 
abeady available experimental data, rather than to the proba- 
bility-weighted mean of the possible results of a single experiment 
yet to be performed on a single particle; accordingly, we shall, as 
a rule, speak of (3), with mental reservations, as the average of s, 
and shall abbreviate it by the symbols av s or : 


av s = s, = 2;.SfP(s<), * = 1, 2, 3, . (S) 

As a measure of the inexactitude of the information concerning 
s contained in a distrihution-in-s, we shall adopt the quantity 

- avsfPisi), i = 1, 2, 3, • . • , (6) 

that is, the probability-weighted root-mean-square of the devia- 
tions of the possible values of's from av s, shall call As the expected 
ni^itude of s, or simply the inexactitude of s, and shall use 
the symbol A, interchangeably with As. The inexactitude of the 
momentimi of our boxed particle, for example, is 

V'(+V2mP - -h (- 

= (7) 

Since As as defined by (6) cannot vanish unless every possible 
va ue 0 s equals^ av s, the vanishing of As implies that only one 
value of s is possible, and that hence the result of a precise meas- 
wement of s is a certainty. Whether the value of s is a certainty 
is shown of course by the distribution-in-s; but sometimes it is 
poffiible to determine As without the Inowledge of P(s). 

If the distnbution-in-s is available, then the possible results of a 
precise measurement of a variable's) and their respective proba- 
bihties, that is, the distnbution-in-/(s), can be calculated from 


stellw ■lames; in statistics it is called the 

it is called quantum mechanics 

of s and an nn f indeterminacy of s, the mdejiniteness 

^ the symbol As is sometimes used 



CLASSICAL MECHANICS 


55 


§11 


the distribution-in-s. The expectation of /(«) can be computed 
directly in terms of the distribution-in-s: 

nv/(s) == 2i/(st)-P(St)) 2 = 1, 2, 3, • • • . (8) 

Thus the expected average of the square of the momentum of our 
particle in the box is 

avp“ = + (- ^/2mEf{^) == 2mE\ (9) 

this example shows that in using (8) we must sum over all possible 
values of s even when the inverse of/(s) is not single-valued, that 
is, even when f{s) and /(§/) are not necessarily different if i and j 
are different. 


Continuous distributions. To introduce the so-callcd continuous distri- 
butions, let us consider, on the basis of the information (1), the result of 
an experiment designed to measure at the time i the precise value of the 
coordinate x of the particle moving in the box of Pig. 10, redrawn in Fig. 
11(a). A. relation of the typo x » x{t) cannot be set up, but several signifi- 



p 



1 



T 



(a) 

Fig. 11. A- potential box. 
under the conditions (1). 


(b) 

(a) The potential. 


(b) The distribution-in-a; 


cant conclusions regarding the outcome of the experiment can nev< rthelcss 
be drawn. In the first place, since the particle never leaves the box, the 
probability of finding it in any region which lies wholly outside the box, 
that is, outside the interval 0 < ^ < h zero. Further, although the 
experiment will certainly locate the particle at some one point within the 
box, the probability of locating the particle at any one ^reassigned point 
is zero, since the number of points at which the particle might be located 
is infinitely great. But the probability of locating the particle between 
any two preassigned noncoincident points within the box is finite, and 
we may compute the probability of locating it between x and x -h da;, or, 
as we shall say for short, of locating it in dx at x. Since the particle moves 
with a constant speed inside the box, and hence spends equal intervals of 
time in regions of equal length, the probability of finding it in dx at x 
. inside the box is proportional to dx and is independent of x. The proba- 



66 CLASSICAL MECHANICS §11 


bility of fiading the particle in da: at x — which we denote by P(,x)dx — is 
conseqaently of the form 


Pix^dx 


0 

Cdx 

,0 


for a: < 0 

for 0 < a; < Z (10) 

for I < X, 


where C is a constant. To evaluate C we note that according to (10) the 
probability of locating the particle somewhere is 



Fix) dx ~ 



Cdx ^ Cl, 


( 11 ) 


and that, since the particle will certainly be found somewhere, we must 
have Cl - 1; hence 


P{x) 



and P(x) has the form shown in Fig. 11(b). 


for X < 0 

for 0 < s < i (12) 

for I < X, 


When the probability that the result of a precise measurement 
of s will yield a result lying in ds at s is written in the form P(s)ds, 
the function P($) is called the expected absolute distribution func- 
tion of s, or simply the distribution4n~s. This terminology is an 
extension of that given before; but note that in the case of a 
discrete distribution P(s) is itself a probability and hence a pure 
number, while if the distribution is continuous P{s) must be multi- 
plied by ds before a probability is obtained, so that, in the con- 
tinuous case, P(s) has the physical dimensions of 

Tf the distribution-in-s is continuous, the following formulas 
take the places of (2), (5), (6), and (8) [recall the convention 
stated above Equation 29®] : 


II 

(13) 

avs = j" sP{s) ds, 

(14) 

= 'j/ J (s — av s)*P(s) ds, 

(15) 

4 

II 

(16) 



57 


CLASSICAL MECHANICS 


§11 


In the case of our boxed particle, we have, for example, 

av X — J xP{x) dx = xdx = \l, (17) 

nd 

aI = f (x — a,v xy P(x) dx = f (x — ilfr'dx = (18) 

*/— oo Jo 


so that 


Ax = ■ y_ Z. 

Vi2 


Relative distributions. The possible values of a variable may 
comprise a set that is in part discrete and in part continuous, a 
case which we shall not discuss here. But we shall illustrate the 
so-called relative distribution functions, which are to be dis- 
tinguished from the absolvie distribution functions discussed so far. 

We may be informed, for example, that a particle of mass m 
and total energy E, E > Vo , moves classically in the potential 
field of Fig. 9, redrawn in Fig. 12(a), and may wish to investigate. 


on the basis of this informa- 
tion alone, the result of an 
experiment designed to meas- 
ure precisely the coordinate 
X of the particle at the time t. 
The probability that x will 
be found to have any one 
preaasigned value is then 
zero, just as in the case of a 
particle in a box; but, in ad- 
dition, the probability that 
X will be found to have a 
value in any preassigned 
finite interval is also zero, 
since an infinite region is ac- 
cessible to the particle. 
Hence the absolute distribu- 
tion function P(x) defined 
above vanishes everywhere. 


V- 


f 


E 

\ 

V, 

^ > 

(a) ° 

p. 

"x 


1 ^ 
' 

♦ 

(b) ° 

X 


Fig. 12. A potential well, (a) The 
energy diagram for the case E > Vt. 
(b) The corresponding relative distri- 
bution-in-® when only E is known. 


Now, the distribution function P{x) = 0 contains significant 


information regarding our particle; yet it fails to display 



58 


CLASSICAL MECHANICS 


§11 


aa importaat fact: when the particl e is in the middle region, 
where y(a:) = 0, its speed is but when it is in an outer 

region its speed is '\/2(B — yo)/^> so that the time spent by the 
particle in any interval in the middle region is shorter, in the 
ratio V(-2? — than the time spent in an interval of equal 

length but lying in an exterior region. 

In cases of this kind, it is advantageous to abandon the absolute 
distribution function P(s) and employ instead a function (denoted 
by P(s) also) such that P(s)ds is proportional to, rather than 
equal to, the probability that s have a value in ds at s. In our 
specific example we may choose the relative .probability P(a;) to 
be some nonvanishing constant C in t he middle reg ion, and then 
its value everywhere else will be C\/EJ(E — yo); this relative 
distribution-in-ic is shown in Fig. 12(b). 

If the only information available regarding a free particle 
moving classically along the a:-axis is that its kinetic energy is E, 
then the relative distribution-in-a: can be taken as Fix) = Cj C 
being an arbitrary nonvanishing constant, preferably a positive 
one; if the momentum (rather than the kinetic energy) of a free 
particle, and nothing else, is specified, the relative .distribution-in-a: 
is again P{z) = (7; this distribution is shown in Fig. 13. When 

an absolute distribution function is 
multiplied by a nonvanishing con- 
stant other than 1, the result is a rel- 
ative distribution function of the 
same variable. 

Stationary distributions. In dis- 
cussing, under the conditions (1), 
the outcome of an experiment de- 
signed to measure the position of 
a particle in a box, we specified 
that the measurement is to be performed at the time i; yet the 
distribution function (12) is independent of t Whenever a distri- 
bution-in-s is independent of the time at which the contemplated 
measurement of s is to be made, the distribution is said to be 
stationary. The distribution-in-a; given by (12) and the distribu- 
tion-in-p given by Table 1 are examples of stationary distributions. 

A uniqueness theorem. When a distribution-in-s is given, then 
the averages of all integral powers of s (that is, av 5 *, fc = 1, 


T 

c 

JL 


Fig. 13. The relative distri- 
bution-in-a; for a free particle 
when only E (or only p) is 
known. 



59 


CLASSICAL MECHANICS 


§11 


2, • • •) are uniquely determined; in the case of the simpler dis- 
tributions, the converse is also true, namely, the aggregate of the 
averages of all positive integral powers of s determines (at least in 
principle) the distribution-in-s uniquely.^ In what follows, we 
shall sometimes be given the values of av s* for fc = 1, 2, 3, • • • , 
and shall be able to identify one distribution P(s) yielding just the 
correct averages; the uniqueness theorem quoted above will then 
enable us to conclude that this P(s) is the correct distribution. 

The state of a system. We remarked earlier that both the 
kind and the state of a dynamical system must be specified in order 
that certain theoretical calculations may be carried out, and we 
noted that the standard method of specifying the state in classical 
mechanics consists in quoting the initial conditions of the motion. 
To make the term state^ as we shall use it, more precise, we shall 
now agree on the following definition: the state of a dynamical 
system at an instant t is specified by whatever information is available 
concerning the system at the instant t, apart from the information 
that the system is of a certain kind. In practice, then, the state of 
the system at some instant U is usually specified by experimental 
data; and the state at a later instant t is specified by whatever 
information concerning the system at this instant that can be 
inferred, with the help of the laws of motion, from that relating 
to the instant . 

The state of an isolated system at any instant t can always be 
inferred from that at an earlier instant k , and consequently the 
specification of the state for an instant to is equivalent to specifying 
the state for all time;® hence we may, for brevity, suppress the 
mention of a particular instant of time when speaking of a state of a 
system, it being understood that such a mention would be made 
in a more careful statement. 

An essential feature of our definition of the term state is that it 
does not require that the information concerning the system 
constitute a maximal set of conditions. For example, we can say 


^ The problem of inferring the distribution function from the averages 
of the integral powers of the variable is discussed, for example, in J. V. 
Uspensky’s Introduction to Mathematical Probability (hereafter cited as 
Uspensky), Appendix 11; New York: McGraw-Hill Book Co., (1937). 

® We disregard the possibility that the computations may be upset by 
the arrival of delayed information. 



60 


CLASSICAL MECHANICS 


§11 


that the state of the ^ (12^ as ‘the dis- 

specifiedby the statements (1), can ref ^ ^ ^ though the 

tribution-in-xforaboredparticlem the state 

conditions (1) are obviously not maximal. + ^^^ology yet 

^dll probably not deny the convenience of . 

our L of the term state may seem to him unattractive, he may 

pStMor to tkot the e«e of 

a. and only if, a maamal set o( cndihonn .e whe^ 

according to classical mechanics, the result o ’ 

measurement can be computed -with certainty. ^ to 

later, however, that conditions which are mammal acco g 
quantum mechanics are not maximal classically, and leave 
results of some precise measurements uncertain. 


Ezeiclses 

1. Show that for either a discrete or a continuous distribution 
A'^s « av s* — av^s, 


( 20 ) 


where is the square of the inexactitude of a, av s® is the average of 
the square of s, and av^s is the square of the average of 5. In computing 
Ss it is often expedient to use (20) rather than (6) or (16). 

2. Show that, if the distribution-in-s is discrete and c av s, then 
53 i(5i — c)*P(si) > prove the corresponding inequality for con- 
tinuous distributions. 

3. One of the most important distributions, the Gaussian or normal 
distribution, is 

P{s) = -^e , (21) 


where the variable s ranges from — « to oo, m is a real constant, and & 
is a positive constant. Show that for this distribution av s ~ w and 
Is = <ff so that we may write (21) as (see Fig. 14) 



Fig. 14. The Gaussian distribution. 
The dotted lines mark the inflections 
at ^ db At . 



(a— «a)® 

(22) 


Note that (22) inflects at s «= Sa 
— A, and at s = 5# -h A,. 

4. Show that the distribution 
function® 


?(a) = 


h 

V[62 -b (s - w)*] 


( 23 ) 


® Uspensky, page 242. 



61 


CLASSICAL MECHANICS 


§11 


(where the variable s ranges from — oo to oo , 7/1 is a real constant, and h 
is a positive constant) satisfies (13), and that its graph resembles that 
of (21). Then use (16) to show that, when the distribution-in-s is (23), the 
inexactitude of a is infinite and that hence the average value of s is not 
significantly representative of the distribution. 

6. Show that, if the distribution-in-a; is (12), then the average of 
a;* is + 1), and [here (20) is helpful] the inexactitude of a* is 
k(k -f l)-'i(2/c d- 1)“^^ 

6. Let P{s) be a continuous distnbution-in-s, and let P[f(s)] be the 
corresponding distribution-in-/(s), so that P[/(s)] df{s) is the probability 
that f (s) will have a value between f{s) and f{s) +■ df{s); show that then 

P[f(s)]= P(.s)/\ris)\. (24) 

Indicate how P[/(s)] can be expressed as a function of /(s)--'rather than 
a function of s— when the inverse of /(s) is single-valued, and also when 
the inverse of f{s) is not single- valued. 

7. Show that, if the distribution-in-a; is (12), then the distribution-in-u, 

where u = x*' and k is a positive integer, vanishes outside the region 
0 < u < P, and is P(it) =* within. Use this distribution to 

verify the results of Exercise 5. 

8. Let the only information available concerning a classical oscillator 

(V be that its energy is certainly P. Show that then the 

absolute distribution-in-a; at the time t is (see Tig. 15) 



Fig. 16. The distribution-in-a; (25) for an oscillator (heavy lines) 

'0 for (c<-\/MJk 

P(,x) = ^ ^ ® ^ Vw* (25) 

^ 0 for -s/^Wjk < X, 

and that av a; « 0, « E/k, = y/E/kt and av r* « 



62 CLASSICAL MECHANICS §11 


9. Show that for the oscillator of Exercise 8 the absolute distribution- 
ia-p at the time i is (see Eig. 16) 




1 


for 

for 


0 for 


y < — A/2mJ^ 

- <p < y/2mE (26) 
y/2mE < p, 


and that av p = 0, av p® = mE, and Ay = y/mE; npte also that A® Ay « 
E -s/mjk* 

10. Explain why the following argtinient is fallacious: 'The speed of a 
particle executing simple harmonic motion is small when the displacement 
is large, and is large when the displacement is small; according to Eig. 15, 
large displacements are more probable than small ones; hence small speeds 
are more probable than large ones, and P{p) should have a maximum at 
y = 0, rather than a minimum as indicated in Fig. 16.^' 

11. Show that the expected averages of 
the potential ^nd the kinetic energies of 
the oscillator of Exercise 8 are each. 

12. Let s be a dynamical variable per- 
taining to a one-dimensional system, and 
let only the energy B of the system be 
known. Show that then the expected 
average of s at the time t can be obtained 
by expressing s in terms of B and the phase 
r, and then averaging the resulting expres- 
sion over T. To illustrate, use the results 
of Exercise to verify the averages com- 
puted in Exercises 8 and 9. What con- 
dition must the dynamical variable s satisfy in order that its distribution 
be stationary when E and nothing else is known? 

18. We instruct a person to toss a well-balanced coin and to adjus'j the 
energy of a classical oscillator {V - to be certainly Ei if heads come 
up, and to be certainly Euf Eji j>^Bz, if tails do; in our absence he does 
as instructed, and then returns the oscillator to us without comment. 
Show that on the basis of this inf ormation : (a) the result of a contemplated 
precise measurement of the energy of the oscillator at the time t is not a 
certainty, and av ^ Ejj) ; (b) the result of a contemplated precise 

measurement of the /th power of the energy at the time t is not a certainty, 
and av E^ = k(Ej -f E[^\ (c) the distributiou-in-rc of the oscillator at 
the time t isP(a;) = 4[Pi(a:) -f- PuOO]^ where Pi{x) and Fzz(x) are obtained 
from (26) by using Ez and En, respectively, for E. 

14. A particle moves in a one-dimensional potential box with a slanting 
bottom, shown in Fig, 17 ; its energy is certainly E, but the phase of its 
motion is unknown. Show that ii E is sufficiently large the particle is 
more likely to be found in the shallow, rather than in the deep half of 
the box, while for sufficiently small values of B the situation is reversed. 



— 0 "ySwE 


Fig. 16. The distribution- 
in-p (26) for an oscillator 
(heavy lines). 



63 


CLASSICAL MECHANICS 


§11 


For what value of E is the particle as likely 
to be found in the deep as in the shallow 
half? 

16- Show that, if P(«) is a continuous rel- 
ative distribution and / P(s) ds is finite, 
then 


av/(s) 


Jf(s)P(s)(t8 
' ]P(s)ds ^ 


(27) 



discuss the ease when / P(s) ds is infinite, ^ potential box 

Consider the corresponding eases for dis- ^ ® anting bottom. 

Crete relative distributions. 

16* Explain how the di8tribution-in-/(s) can be deduced from the 
distribution-in-s when the latter is discrete. 

17. Show that, if the distribution-in-s is such that one of the possible 
values of s, say si , is less than every other possible value/ then a.v s > 8i . 


12. Probability Packets 

Classical mechanics imposes no limitation on the accuracy with 
which any set of dynamical variables pertaining to a given system 
can be simultaneously measured. Any particular apparatus used 
for the measurement of r, say, will of course have its own limit of 
precision; but, if greater precision is required, another, more 
delicate, apparatus can in principle be constructed. Classical 
mechanics implies that the improvement of the apparatus for 
measuring rr can be carried on indefinitely, so that the precision 
will exceed any preassigned standard, however fine. Similarly, 
the precision of apparatus for the measurement of p, or of appa- 
ratus for the simultaneous measurement of r and p, can, according 
to the classical view, be improved without limit. 

In practice, however, the experimental information regarding the 
values of physical quantities is never absolutely precise; the 
statement, for example, that the coordinate of a particle at the 
time to is xo usually means that at a time near the instant the 
coordinate has a value near , and that a value much different 
from cco is unlikely to be the true value ; such a statement is usually 
accompanied by quoting some estimates of the reliability of to 
and a;o , commonly, their probable errors, which depend on the 
technique used in the measurement. Having measured x with a 


^ This condition is not satisfied, for example, when the possible values 
of 8 form the infinite set 1, 4, i, J, • • • . 



61 


CLASSICAL MECHANICS 


§12 


certain apparatus, we obtain in fact a distribution function giving 
the probabilit 7 that an absolutely precise measurement of r 
would have resulted in a value lying in dx at x. 

In the text and in some of the exercises of §11 we investigated 
the results of experiments to be performed at a time t and designed 
to measure accurately the value of a dynamical variable pertaining 
to a system whose state is specified by its energy alone. We 
shall now continue to consider classical systems whose states are 
specified by information that is not equivalent to a precise knowl- 
edge of the initial conditions ; but, instead of assuming that the 
energy and nothing else is known, we shall turn to cases in which 
the information refers to the values of x and p at the time k , but 
does not provide the initial conditions of the motion with absolute 
precision. 

An example. Let us consider the result of a precise measure- 
ment, to be made at the time iy of the position of a free particle, the 
available information being as follows: The momentum of the 
particle at the instant i = 0 was precisely po ; the position at ^ = 0 
is not known with absolute precision, but the probability that the 
particle was in dx at x at the instant ^ = 0 is P\x) dx, where 







0 ) 


this information concerning x may be regarded as having been 
obtained by means of an instrument having a resolving power of 
the order of Ax] the assumption that P^(x), is a normal distribu- 
tion, made principally for mathematical convenience, is perhaps 
not unreasonable. 

We are not able to compute the result of a precise measurement 
of X at an instant t with certainty, and the best we can do is to 
calculate the probability that the measurement will yield a result 
lying indx 2 itx;-we shall denote the distribution-in-a; at the time t 
by P(x). Now, our particle is moving with a uniform velocity 
vq — pa/m, so that it will be exactly at cc = Z when t ^ T if it was 
exactly at a; = Z — v^T when t == 0, and it will be exactly at 
X + dX when t — T if it was exactly at Z dX — VoT when 
i — 0; the probability that it will be between X and X + dX at 
the instant T thus equals the probability that it was between 
{X - t;or3 and (X - vqT) + dX when i = 0. According to (1), 



65 CLASSICAL MECHANICS §12 

the latter probability is (A»-v/^)~* 6xp {— [(X — VnT) — x^/ 
2A*}rfX, so that, writing x for X and t for T, we get 

1 [a;~ (a?o+i>oO]^ 

P(jx) = — (2) 
A,V2^ 

Graphs of (2) for several equally spaced values of beginning 
with t = 0, are indicated in Fig. 18, where po is taken as positive. 

It is often convenient to 
speak of the distribution func- 
tion of a dynamical variable 
as the probability packet 
describing this variable, par- 
ticularly if the distribution 
function vanishes or nearly 

vanishes everywhere except pjg, Successive graphs of the 
in a limited region. The ex- packet (2). 
pected average of the variable 

is then called the center of the packet. If this average varies 
with the time, the packet is said to be moving. In our case, for 
example, the position of the particle is described by the proba- 
bility packet (2), whose center lies at a; = rro 4- *^ 0 ^ at the time 
the center of this packet thus moves in precisely the way in which 
our particle would move were its position and momentum 
at f = 0 exactly xo and po . 

Since our particle is not being accelerated, the result of a precise 
measurement of its momentum at any time will be certainly po ; 
similarly, the result of a precise measurement of its energy at any 
time will be certainly E .== p?/2m. 



Second example. Let us consider next the result of a precise position 
measurement, to be made on a free particle at the time if the available 
information is that the position of the particle at i « 0 was precisely Xa , 
while the momentum at i = 0 is not known with absolute precision, but 
the probability that the momentum was in dp at p when t « 0 is dp times 


PKp) 





(3) 


Since our particle is not being accelerated, its momentum at any time 
is the same as it was at i « 0, whatever its value at i « 0 may have been. 
Now, the particle will be exactly at X when i if its momentum is 
exactly m(X — xol/T, and it will be exactly at X -f dX when t ^ T ii its 



66 


CLASSICAL MECHANICS 


§12 


niamcntum is exactly 7n(X -h — Xq)I7\ that is, 7/1 (X — xq)/T 4- mdKIT\ 
the probability that the particle will be between X and X 4- dX when t = T 
thus equals the probability that its momentum is betw^een Kn[X — xo)/T 
and m(X — x^)fT 4- mdX/T, According to (3), the latter probability is 
(^pVS)''expl-lm(X - x,)/T - poP/S^J} -TndX/r, so that, writing 
V = vim, and A„ « Ap/w, using x and t for X and T, and rearranging the 
result, we get 


P{x) - 


1 

7 =r e 

2 Av 'V Sir 


[ac-CsoH-^oO 1* 


( 4 ) 


Comparison of (4) with 22^^ shows that (4) is a normal distribution, 
av as at the time t being Xo -f voi, and A* at the time t being , where A» 
is the constant inexactitude of the velocity. Thus the packet not only 
moves with the constant velocity vq , but also spreads out in the course 
of time, so that our estimates concerning the outcome of a precise measure- 
ment of X become less and less reliable as the instant at which the meas- 
urement is to be made is postponed relatively to t = 0. Graphs of (4) 
for several equally spaced values of t are given in Tig. 19, where po is taken 




Fig. 19. Successive graphs of the packet (4) . The arrow indicates that 
at £ « 0 the particle was certainly at xo . 

as positive, and where the arrow indicates that the particle was certainly 
at at f s 0. 

The energy of our particle is, of course, not a certainty, because the 
value of p is not certain; but the packet describing the energy, that is, 
the distribution-in-J^, is stationary, just as is the distribution-in-p. 

Third example. We shall finally consider the case when the available 
information regarding a free particle is that neither its position nor its 
momentum at t - 0 is known with absolute precision, but that the distri- 
bution-in-a; and the distribution-in-p at £ = 0 are, respectively, 


1 

(i— aro)* 

(5) 

PKx) = e 

Aa!V2ir 

" 2A2 

X i 

1 

(?— J>o) ^ 

(6) 

P“j(p) = — e" 

V 


The distributions (5) and (6) are to be regarded as independent, so that 
the probability that x had a value in da; at a; at £ = 0 and that at the same 



67 


CLASSICAL MECHAIS-ICS 


§12 


iastant p had a value in dp at p is Pf(r)PS(p) dx dp. The simultaneous 
distribution-in-a;-and-p at i = 0, given by P\{x)Plip), may be pictured 
(see Fig. 20) as a surface lying above the rcp-plane. IsTote that we may 
legitimately speak of the simultaneous distribution of x and p only if no 
limitations are put on the precision with which x and p can be measured 
simultaneously. 

Now, our particle will be certainly at X when i = T, if its position x 
and momentum p at f = 0 satisfy the relation 


X -f pT/m — X ; 


(7) 


Equation (7) may be interpreted as specifying a straight line in the asp- 
plane of Fig. 20. Again, the particle will be certainly at X +- dX when 
/ = P, if its position and momentum at / = 0 satisfy the relation 

X + pTJm X +dX, (8) 


specifying another straight line in the rrp-plane. The probability, PQCjdX^ 


that the particle be between X 
and X dX when t ^ T ia thus 
equal to the probability that the 
point in the a;p-plane correspond- 
ing to the true (but unknown) ini- 
tial conditions will lie between the 
two parallel straight lines (7) and 
(8). This latter probability, in 
turn, equals the volume bound- 
ed at the top by the surface 
Pi{x)Pl(p)t at the bottom by the 
a;p-plane, and at the sides by the 
two vertical planes cutting the xp- 
plane along the lines (7) and (8); 
that is. 



Fig. 20. The simultaneous distri- 
bution of X and p for the case given 
by (6) and (.6). The horizontal plane 
is the zp-plane. 


PiX)dX 



m(X-x>/r+mdXfT 


P?(a;)PS(p) dp dx. 


Now, if € is sufficiently small, then 



J(8)ds 


tfia), 


(9) 

( 10 ) 


so that (9) becomos 
mdX 


PiX)dX 


mdX 

27r2A; 


mdX r 
^ 1 
'.dx r 
j 


P!(a:)PS lm(.X - x)/T] dx 


exp < - 


(x - so)*) f - x)/T - Pol*) , „ , ^ 

—^-yexpJ }dx. (11) 


2A| 



CLASSICAL MECHANICS 


68 


§12 


If a, b, c, d, f, a, and /3 are independent of s, we hare the standard formula 



k - g]* 


l b(f - cs) - 


'} 


ds 


W -^ 


(f-ac- d/6)» l 

c*a* + 2(c*oi* + |SVh*)/ 


2to? 0^ 


( 12 ) 


The integral in (11) has the form (12), and, after writing x and t for X and T 
in the result of the integration, we finally get 


1 [a:— (a:o+‘«oO 

P{x) = - p- ■ — 7 = e~ 2T5 |+«sA*) . 

VAl -l-i»A5 V2n- 


(13) 


The a;-packet (that is, the packet describing x) is therefore again a 
normal distribution, av x a t the tim e t being ro -h and the inexactitude 
of X at the time t being -v/a* The a;-packet thus moves (see Fig. 21) 

with a constant velocity and 
/"V spreads at the same time. The 

^ ^ ^ p-paeket (that is, the packet 

* describing p) is at all times the 

“ ^ ^ ■ stationary packet (6). The E- 

packet is of course also stationary. 


Fig, 21. Successive graphs of the 
packet (13). 


The ^-function. In conclu- 
sion, let us return to Fig. 19, 
where we used an arrow to in- 
dicate that at i = 0 the parti- 
cle is certainly at , and consider what function of x might 
be used for the distribution-in-ir when x is certainly equal to a?o • 


When the particle is certainly at Xq the probability of finding it in 
auy region not including x^ is zero, so that the required function, 
which we shall denote by ^(a; — ajo), should satisfy the relation 


d(x — a:o) = 0 for x 9^ Xo- (14) 
B^irther, if the distribution is to be absolute, we must have 

£•- - Xo)dx = 1. (15) 

Now, (14:) and (15) are not consistent if the integral in (16) is 
thought of in the usual sense, because, if a function differs from 
zero at only one point, its integral vanishes when interpreted in the 
usual way. We shall, nevertheless, employ the function d(x — Xo), 
defined by (14) and (15), with the understanding that the results 



69 


CLASSICAL MECHANICS 


§12 


obtained by using it can be verified by recourse to appropriate 
limiting processes, and without reference to a function satisfying 
both (14) and (15). In quantum mechanics the function 6(x — a:o) 
(the d-function) was first used by Dirac. 

To visualize 6(x — xo) we may return to Equation (4) and allow 
t to decrease toward zero. The numerator of the exponent in (4) 
will approach {x — Xq)'^ when ^ 0, so that, writing ^ for ^A„\/2, 

we get 

lim ^ c (16) 

for d(x — Xo). The definition of 5ix — Xo) is, however, (14) and 
(16^ rather than® (16). At x — ro, d{x — xo) is of course infinite. 

Our interest in the 6-function is due chiefly to the fact that in 
view of (14) 

x8(x — Xo) = XQd{x — Xo)j (17) 

so that d{x. — Xo) satisfies the eigenvalue equation 6® for the opera- 
tor X. According to (16), d(x — Xo) can be interpreted as a limit 
of a well-behaved function; but d{x — a:o) itself is not well-behaved 
in the sense of §3. We shall nevertheless adopt 6 (a; — xo) as an 
eigenfunction of the operator x belonging to the eigenvalue Xo of a:, 
even though this step will undermine the mathematical rigor of a 
few of our computations. It then follows that any real number 
(positive, negative, or zero) is an eigenvalue of the operator r, 
and that x has no complex or pure imaginary eigenvalues. 


Exercises 

1. Suppose that at i == 0 the diatribution-in-a; and the distribution-in-p 
of particle comprising a liiK'ar harmonic oscillator arc (5) and (6), respec- 
tively, and show that then the .r-packet and the p-packet at the instant i 
are 


Pix) 


'v/aJ cos* (at + sin* (at “v/Ztt 


exp 


[x — (a:o cos (at H- pouT ^vr ^ sin coQ ]*] 
2(aJ cos* (at -i- Aj7n"‘*<o"* sin* (at) j 


(18) 


® Note, for example, the equation following Equation (643) of J. H. 
Jeans's The Mathematical Theory of Electricity and Magnetim (hereafter 
cited as Jeans) Fifth Edition, Cambridge, The University Press, 1925. 



70 


CLA.SSICAL MECHANICS 


§12 


P(p) = 


■s/ A| sin’wi + Aj cos® at y/ 2 ti 


exp<- 


[p ~ (.—Xdom sin tot + yo cos mQ]® 
2(aJot®£i)® sin® ut + a£ cos® at) 


. (19) 


Consider the motions of the centers of the two packets in the light of 
12»® and 13'“, and note the periodic changes in the shapes of the packets. 

2. Show that, for the oscillator of Exercise 1, av = (pj + a|)/ 

2to + Hxl + a')/2. 

3. Note that, if A® = mAA®, then the results of Exercises 1 and 2 
reduce to 

P(a:) = exp ^ ~ sin oit)]® ) 


Pip) = 


»»uA.-\/2t 


_ly — i-Xoam sin at + po cos «i)]® 


exp ^ 


2in®a)®A® 


^yE = ~pl+lkxl + kAl, ( 22 ) 

so that in this special case the packets move without change in shape. 
Illustrate the motions of the packets (20) and (21) by diagrams. 

r,( +V. *^5', ~ > “'“'1 *» = Po = 0, then the distribution-in -E7 

01 tne oscillator of the preceding exercises is 

“ d® ^ 9> (23) 

av E = fcA® , and the most probable value of E is zero. Note that in 

^ ^ probability packets for z and p are both stationary, 
even though the energy is not a certainty. 

n.,fv t that in the case of a linear harmonic oscillator the probability 

of th?fo™'*th^ 2TVWk, regardless 

01 the form that they may have initially. 

bilitv^?^'l^+ the physical meaning of the various proba- 

Dility packets of this section for times prior to i = 0. 

7. With the help of the formulas 

r* -- - 

I e-t® ds = Vx { ( 24 a) 


•c t’w. - (2ra - 1) 


(24b) 



CLASSICAL MECHANICS 


71 


§12 


where w is a positive integer, show that, if fix) can be expanded into a 
power series in x in the vicinity of ajo i then 


lim 



(a— a;o)" ~j 

e” \dx = fixf). 


(25) 


Equation (25) can be interpreted as meaning that if a normal distribution- 
in-a: is varied in such a way as to confine the particle to a greater and 
greater extent to the immediate neighborhood of a:o , then the expected 
average of fix) approaches /(a; o), as of course it should. 

8. Show that in view of (14) and (15) 



fix)8ix - xq) dx - fixo); 


(26) 


one interpretation of this equation is that, if the distribution-in-a: is 
dix — Xq), that is, if the value of x is certainly Xo , then the expected average 
of fix) is fixo). Comparison of (26) with (25) illustrates that a result 
obtained by means of the 5-function can also be obtained without its use 
if appropriate limiting processes are employed. 

9. Discrete distributions, which we have so far presented in the form 
of tables, can be expressed in terms of 5-functions, and then treated for- 
mally as continuous ones. Consider the ‘continuous’ distribution-in-p 


Pip) = idip -h ^y2mE) + i6(p — \/2mE) (27) 


and show [by computing av p*, using (27) and 16“, and comparing the 
result with av p* obtained from Table 1 by means of 8“] that (27) is equiva- 
lent to the discrete distribution-in-p of Table 1. Note that the physical 
dimensions of the right side of (27) are just those required for a continuous 
absolute distribution-in-p. 



CHA.PTER III 


ELEMENTS OF THE SCHROEDINGER METHOD; 

ONE DIMENSION 

la §13 we shall state two quantum-mechanical assumptions 
which are quite general in that they do not refer specifically to 
any one of the several distinct mathematical methods that the 
theory may use. In §14 we shall turn to the Schroedinger 
method (as it appears in the light of developments that followed 
its inception) and, disregarding historical order, shall base this 
method in part on the assumptions of §13. Incidentally, the 
Schroedinger method is sometimes called the use of the ^Schroed- 
inger representation.’ 

13. Two Assumptions 

The postulates of any mathematical theory of physical phe- 
nomena — ^for example, Newton’s laws of classical mechanics, or the 
fundamental assumptions of quantum mechanics — are a set of 
categorical statements that the student is asked to take for 
granted. Back of such postulates there is a long story of search, 
trial, and achievement, a story both instructive and inspiring. 
But, once a set of postulates is formulated, its worth is to be 
Judged solely by its effectiveness in leading to theoretical results 
that are in agreement with experimental facts, and by the gener- 
ality of its applicability. 

How easily the postulates of a theory can be made to appear 
intelligible and reasonable to the student depends in a large 
measure on the readiness with which physical conclusions regarding 
some familiar phenomena can be derived from the postulates. 
Now, Newton's laws can be made to appear sensible by a quite 
elementary discussion of the student’s experiences in daily life. 
But in the case of quantum mechanics the deduction of the 
properties of large-scale phenomena encountered in daily life 
involves much calculation, while the results obtained most easily 
have to do with atomic phenomena, with which none of us is 
acquainted through intimate personal contact. Consequently we 

72 



73 


INTRODUCTORY PROBLEMS 


§13 


cannot expect that the physical significance of the postulates given 
below will be clear as soon as they are stated; but we hope that 
as the reader studies their use in answering specific physical ques- 
tions about specific dynamical systems, he will gradually perceive 
their physical content. 

A rigorous and complete statement of the fundamental rules 
and assumptions of quantum mechanics is of necessity quite math- 
ematical and abstract, and is therefore not the best starting 
point for a beginner. The rules and assumptions which we shall 
state and which will take the reader a long way toward under- 
standing the quantum-mechanical method are easy to memorize, 
and, in almost every case, each of them can be put to use before 
the next is introduced; but not all of them are free from defects. 
For example, in the second sentence of our first assumption we 
use the term ^precise experimental measurement^ without a word 
of explanation, although the subject of quantum-mechanical 
measurement requires pages of advanced discussion;^ we fail to 
mention certain relevant matters, such as the radiative effects 
hinted at in §31; besides, the assumption made in this sentence is 
eventually shown to be contained in another assumption made 
later. 

Our introductory assumption, which we shall call the operator 
assumption, is as follows: 

I. The physical properties of any dynamical variable pertaining 
to a given d3niamical system can be inferred theoretically from 
the mathematical properties of an operator to be associated with 
this dynamical variable. In particular, the possible numerical 
results of a precise experimental measurement of a dynamical 
variable are the eigenvalues of an operator associated with this 
dynamical variable, and vice versa. 

With every dynamical variable we thus associate in quantum 
mechanics an operator, A dynamical variable and ‘an operator 
associated with it will be denoted by the same letter, it being left 
for the reader to determine from the context whether a symbol, 
say a, stands for the dynamical variable a or for an operator a 
associated with the dynamical variable a. To minimize the 

^ E. C. Kemble, The Fundamentat Principles of Quantum Mechanics 
[hereafter cited as Kemble], Chapter IX. New York: McGraw-Hill 
Book Co., 1937. 



74 


THE SCHROEDINGER METHOD 


§13 


possibility of coafusioa, we shall (until Chapter XIII) usually 
employ the following convention if an expression (or equation) 
contains a symbol that is to be interpreted as an operator asso- 
ciated with a dynamical variable, rather than as the dynamical 
variable itself, then the marginal number of the expression (or 
equation) is enclosed in angular brackets rather than in paren- 
theses. 

Although the operator assumption does not provide a method 
for associating operators with dynamical variables, it does require 
t^hat operators to be associated with real dynamical variables must 
have only real eigenvalues; it also suggests that the operators to 
be associated with numerical constants, such as 0, 1, 2, ^r, m, and 
so forth, are the corresponding operators of type c, that is, the 
operators 0, 1, 2, ir, m, and so forth. A detailed program for 
associating operators with dynamical variables not involving ( 
explicitly IS provided by the following rule® (the symbol h, intro- 
duced by Dirac, stands for hf2ir, where h is Planck’s constant, 
equal to approximately 6.6 X 10“®’ erg- seconds; as usual, 
z = - 1 ); 


*. 1 . oe and <5, to be associated, respectively, with 

the oynanucal variables « and must he so chosen that 

a? - ?cc = ih[a, ,0], (l> 

where [a is the operator to he associated with the classically- 
computed Poisson bracket of a and /3. 

• (l)j through which Planck’s constant is introduced 

into the theory, is called the quanium condition. The appearance 
o [a, 1^1 implies that the dynamical system to be studied 
is desenbed classically in terms of some coordinate system, and, 
unless the contrary is explicitly stated, we shall take these coordi- 
nates to be Cartesian (rectangular) coordinates. 

qufettion \l) involves the constant ih and three operators; that 
to be associated with the classical dynamical variable «, that to be 


of J- H- Vaa Vleck in his Theory 

193? ^ ^asnehc Svsceptibilities. Oxford: The Clarendon Press, 

oj Quantum Mechanics, Second Edition 
Ihereafter cited as Dirac], page 90. Oxford: The Clarendon Press, 1935. 



75 


INTRODUCTORY PROBLEMS 


§13 


associated with the classical dynamical variable jS, and that to be 
associated with the classical dynamical variable [a, 0]. The 
structure of <l) depends, therefore, on the form of [a, jSj, and is 
particularly simple if the Poisson bracket is a constant, because 
then the latter can be immediately associated with an operator 
of type Cj and the number of operators involved in (l) is essen- 
tially reduced to two. 

To illustrate the implications of (l), let us consider the opera- 
tors — which we denote^ by x and p — to be associated, respectively, 
with the Cartesian coordinate x and the momentum p of a particle 
undergoing one-dimensional motion. In this case, {l) becomes 


xp — px = ih[x, p]. 

(2) 

Now, classically, 


(12*) [x, p] = 1, 

( 3 ) 

SO that we can write the operator 1 for the operator [x, p] on the 

right side of (2), getting 


xp — px = ihl, 

( 4 ) 

or simply. 


xp — px = ih. 

( 5 ) 


Consequently, the operators to be associated, respectively, with 
the coordinate x and the conjugate momentum p must be so 
chosen that their commutator is the constant ih. Incidentally, 
the theory imposes no conditions on the operators to be asso- 
ciated with X and p, apart from (5) and the requirement that the 
respective eigenvalues of these operators be real. 

In view of (l), the quantum conditions for the operators or, jS, 


* Heretofore we used the symbol x for the independent variable and 
for the operator x in mathematical computations, and also for the dy- 
namical variable x describing the position of a particle in physical prob- 
lems. In equations such as (2) the symbol x is used in a still different 
sense, namely, for an operator (whatever its specific form may be) to be 
associated in quantum theory with the dynamical variable x. These 
distinct uses of the same symbol should not be confused; it is of course 
possible to decrease the likelihood of a misunderstanding by employing 
several symbols instead of one, but the disadvantages of such a procedure, 
in the long run, outweigh its advantages, especially since but little care 
on the reader's part is required to avoid falling into error. 



76 THE SCHROEDINGER METHOD §13 

aad y, to be associated, respectively, with three dynamical 
variables a, and y, which do not involve t explicitly, are 

= ih[aj ^], ^y = ife[jS, 7], ya — ory = ih[yy a], (6) 

where [a, /3], [/3, y], and [ 7 , a] are the operators to be associated 
with the respective classically calculated Poisson brackets. The 
case of more than three variables is similar. The reason for the 
self-consistency of the large set of commutation rules that comes 
into play when the number of variables under consideration is 
great is that commutators of functions of operators satisfy rela- 
tions nauch like those satisfied by the Poisson brackets of functions 
of dynamical variables— recall 29 ^ and 14 ®. 

The quantum-mechanical method of associating operators with 
dynamical variables determines, through the quantum conditions, 
the commutation rules that the operators must satisfy, but does 
not specify in detail the individual operators themselves. This 
implies that detailed specification of individual operators is not 
necessary for the calculation of the theoretical data to be com- 
pared with experiment, and that quantmn-mechanical calcula- 
tions may be carried out by the purely symbolic methods of 
operator algebra. But if a set of some specific operators is avail- 
able whose eigenvalues are real and whose commutation rules are 
just those required by quantum mechanics, it is usually expedient 
to associate the various dynamical variables with these specifiic 
operators. 

The method of associating variables and operators embodied 
in {1) applies to dynamical variables that are a part of the struc- 
ture of classical mechanics. It appears, however, that dynamical 
variables exist in nature (those having to do with the so-called 
electron spin are an example) which defy classical description, or, 
as we shall say, have no classical analogues. For such variables 
the present method is insufficient. 

As implied in the operator assumption, the calculation of the 
possible^ values of a dynamical variable® consists, in quantum 
mechanics, in the computation of the eigenvalues of an operator 
associated with this dynaimeal variable. Now, as illustrated in 

From now on we shall, for brevity, say the possible values of a dynamical 
variable instead of the possible numerical results of a precise experimental 
measurement of a dynamical variable. 



77 


INTEODUCTORY PROBLEMS 


§14 


Chapter I, the question of the eigenvalues of certain operators 
may not arise unless the operands are restricted by some auxiliary 
conditions, such as continuity and boundary conditions. But we 
shall not make here any general assumption to take care of such 
additional conditions, because they depend on the specific mathe- 
matical procedure adopted in the solution ot a quantum-me- 
chanical problem. 

14, Schroedinger Operators 

Quantum-mechanical calculations may be carried out either by 
purely symbolic methods of operator algebra based on the quan- 
tum conditions l“, or by associating the various dynamical 
variables with operators of specific explicit forms. The latter 
alternative, which we shall consider first, leaves us a certain 
freedom of choice, and we shall begin with a particularly simple 
and useful scheme of association, the Schroedinger scheme. 

The operators associated in the Schroedinger method with dy- 
namical variables pertaining to one-dimensional systems are 
specific functions of the operators h/bx and a;, and of operators 
of type c, studied in Chapter I. The operands employed in the 
Schroedinger theory of one-dimensional systems are, accordingly, 
functions of z; they are usually denoted by the symbol and are 
called Schroedinger Junctions j wave Junctions j rfy-JunctionSj or simply 
^'s. All Schroedinger ftmctions are required to be well-behaved. 
We shall not attempt here to give a comprehensive definition of 
good behavior; for our elementary purposes it is sufficient to say 
that one-dimensional Schroedinger Junctions are required to he well- 
"behaved in the sense oj §3. 

To begin the process of association, we turn to the funda- 
mental dynamical variables of one-dimensional motion, that is, 
the Cartesian coordinate x and the momentum p conjugate to it. 
In consequence of the quantum condition, the operators x and p, 
associated, respectively, with these dynamical variables, must, in 
any scheme of association, satisfy the relation 

{5^®) xp — px — ihj (l) 

and it remains to recognize a pair of specific operators, of the 
kind studied in Chapter I, whose commutator is ih and whose 



78 


THE SCHROEDINGER METHOD 


§14 


eigeavalues are all real. Nov, since ih is a constant, it follows 
from 30® that 

so that the specific operators x and -’ihdjdr have just the com- 
mutator ref^uiredby (l); further, the eigenvalues of each of those 
operators are all real. Consequently, one' of the methods— the 
Schroedinger method — of associating specific operators uoith the ri^/- 
Tiamical varialles x and p is to associate the operator x with the 
dynamical variable x, and the operator —ihd/dx with the dynamical 
variable p. In symbols, 

dynamical variable x ^ operator x 

dynamical variable p ^ operator ^ih — . 

dx 

W e shall next indicate the operators associated in the Schro<‘- 
dinger scheme vrith dynamical variables that are functions of .r 
and p, but do not involve explicitly the time t, and shall begin 
vith the dynamical variable x^. The Poisson brackets of 
with X and with p are 


x] = 0, 


[x\ 35 ] = 2z, 


so that, if the operator to be associated with be temporarily 
denoted by a, the commutators of a vith the operators already 
introduced for x and p must, according to the quantum condi- 
tions, be 

<a-». = 0. <5) 

Equations (5) arc just the equations 37®, and their general solu- 
tion is 

“ = + c, <6) 

dS coMtant. The physical assumptions intro- 

an additional ° ^ specific value to c, but 

cLseomnri of its 

rf, the operator to he associated vdth a 

dynamcal var^able «(« [that is, a dynamical variable which is 



79 


INTRODUCTORY PROBLEMS 


§14 


explicitly a function of the dynamical variable ^ only] is the operator 
a(|), where $ is the operator associated with the dynamical variable 
J itself. This rule requires that we set c = 0 in (6), getting the 
association 

dynamical variable ue ^ operator (7) 


The statement just made in italics requires that, having adopted 
the associations (3), we must also adopt the associations 

dynamical variable a(x) ^ operator a(a:) (8) 

dynamical variable ct(p) ^ operator a( —ik (9) 


and hence the association (7) can be set up without going through 
the steps (4) and (6) ; we used these steps to illustrate the possi- 
bility that the quantum conditions may leave an operator undeter- 
mined to the extent of an additive constant. 

An important dynamical variable is p^; according to {9) the 
operator associated with it in the Schroedinger scheme is the 
square of the operator —ihd/dx associated with p itself: 

dynamical variable ^ operator r-n- (10) 


For a dynamical variable that is the sum of a function of x 
only and a function of p only, we adopt the association 


dynamical variable a(x) + I3{p) 


^ operator a(x) + ^ 



< 11 ) 


of which all our previous associations arc special cases. That 
{ll) is consistent with the quantum conditions follows from the 
appropriateness of (3) and the similarity between 29^ and 14^. 

The identification of a Schroedinger operator to be associated 
with a dynamical variable of the form a{x)^{p) requires special 
consideration. In the simple case of the variable xp (which 
classically can be denoted equally well by px) we may proceed as 
follows: Wc note that xp is the Poisson bracket of the variables 
and 

xp = [ix\ ip% (12) 

The quantum condition then requires that the operator asso- 



80 


THE SCHROEDINGEE METHOD 


§14 


elated with the dynamical variable zf be times the com- 

mutator of the operators already associated with and ^p, 
•that is, —ifr^ times the operator 



- ^1 + 2z^- ^ ^ 

and hence 

dynamical variable x'p (or px) 

operator | ^ (^—ih a;J|. (14) 

This operator is, so to speak, an average of the operators 
x{—ihdldx) and {—ihd/dx)x, suggested, respectively, by the 
classical expressions xp and px. 

Dynamical variables of the form a(x)i8(p), or sums of such 
variables, will appear in our work only indirectly, and w’e shall 
not consider them any further here. Also, we shall postpone 
until later the proof that all the operators adopted above have 
only real eigenvalues, a property which, in view of the operator 
assumption I, is very vital. 

A dynamical variable of particular interest is the Hamiltonian, 
given in the ease of one-dimensional motion by 

(6») H = ^/-hF(x). (15) 

The operator, denoted by H, associated with the Hamiltonian in 
the Schroedinger method is, in view of (8) and (lO), 

where the operator V{x) is the same function of the operator x 
as the potential V{x) is of the coordinate x. When we introduce 
the Schroedinger constant 

Sir^m __ 2m 


K 


(17) 



81 INTRODUCTORY PROBLEMS §14 

which we shall use throughout our work, Equation (16) becomes 

B--\ ^ <18> 

In view of the operator assumption, the calculation of the 
possible values of the energy of a dynamical system consists in 
determining the eigenvalues of an operator to be associated with 
the Hamiltonian of this system; these eigenvalues will be denoted 
by E. In the Schroedinger scheme, the operator associated with 
the Hamiltonian is (18), and the operands are required to be 
well-behaved; hence the Schroedinger 'procedure for calculating the 
possible results of a precise experimental measurement of the energy 
of a conservative one-dimensional system described classically by the 
potential function V(x) is to determine those values of the number E 
for which the equation 

Hi = Ei <19) 

possesses well-behaved solutions j the operator H in (19) being given 
by <18>. 

Equation (19), with H given by (18), is the ono-dimensional 
form of a fundamental equation of quantum mechanics, dis- 
covered by Schroedinger® and called the first Schroedinger equation^ 
or simply the Schroedinger equation. 

The one-dimensional first Schroedinger equation is often 
written, in a more explicit form, as 

0 + _ y(a:)]^ = 0. (20) 

Exercises 

1. Consider the multiplied^out form of the Schroedinger operator asso- 
ciated with the dynamical variable {Ax -f Bp)*, where A and B arc con- 
stants, and note that it suggests the association {14). 

2. Examine the fitness of the following associations as alternatives 
for <3): (a) dyn. var, x 7 =^ operator fx, and dyn. var. p ^ operator ~‘hd/dx; 
(b) dyn. var. x operator ihd/dxj and dyn, var. p operator x; (c) dyn. 
var. X operator —ihd/Bxj and dyn. var. p ^ operator x. 

16. Energy Levels of an Oscillator 

To illustrate the Schroedinger method of dealing with physical 
problems, we shall now consider a linear harmonic oscillator of 
mass m moving in the potential field 

®E. Schroedinger, Ann. der Physik, 79, 361, (1926). 



82 


THE SCHROEDINGER METHOD 


§16 

( 1 ) 


(14“) V{x) = hkx\ 

and shall take up the question: What are the possible results of a 
precise experimental measurement of the energy of this oscillator? 

Incidentally, we would hardly ask this question in classical 
theory, because, when V{x) is given by (1), the energy of a 
classical oscillator can obviously have any positive value what- 
ever, zero included, depending on the initial conditions of the 
motion. But in quantum theory we shall draw a conclusion 
quite different from this. 

To answer our question, we must, according to the operator 
assumption, calculate the eigenvalues of the operator associated 
with the Hamiltonian of the oscillator; in view of 18^^ and (1) 
this operator is 


1 a' 

dx^ 




(2> 


so that the Schroodinger equation, Il\p = is 

The fact that (3) is a partial differential equation implies that 
^P may be a function of other independent variables besides 
x; but, since x is the only independent variable appearing 
in (3) explicitly, the dependence of ^ on a; is determined by the 
equation 

and \l/ can depend on variables other than x only through the 
arbitrary constants contained in the solutions of (4). Now, (4) 
is of the same general form as the equation 

("li + w 

which possesses well-behaved solutions when, and only when, 
(28^) X = 2 ji 4- 1, 


n = 0, 1, 2, 3, . . . , (6) 



83 


INTRODUCTORY PROBLEMS 


§15 


mi we shall transform (4) into (5) by a change of. variable. 
The substitution 


s = x/a, ( 7 ) 

where a is an as yet undetermined constant, carries (4) into 

, 1 , 4 2 , 2 „ 

~ ^ 2 ® ~ a . (8) 

We now choose 

a = \/'^K = ( 9 ) 

and set 

E \/a^ K = }Ah\/lc/m (10). 

so that (8) goes over into 


+ ( 11 ) 

that is, into an equation precisely like (5). The eigenvalues of 

our H can now be obtained by substituting into (10) the numbers 
listed in (6) : 

(n + ^)h n = 0, 1, 2, 3, . . . . (12) 

The eigenvalues of the operator (2) associated with the Hamil- 
tonian of the oscillator are thus the numbers (12), and in accord- 
ance with our operator assumption we conclude that the only 
jiosstbU results of a precise cxpainiental measurement of the energy 
of a linear harmonic oscillator of mass m and restoring constant k are 



By means of the abbreviation 


( 11 “) 



(14) 


where the symbol vc is used to remind us that the right side of 
(14) is just the frequency of a classical linear harmonic oscillator 
of mass m and restoring constant k, we may write (12) as 

En = (n + i)hvc , n = 0, 1, 2, 3, • . . . (15) 

The answer given by quantum mechanics to the queistion asked 



84 


THE SCHROEDINGER METHOD 


§15 


at the beginning of this section is thus quite different from the 
classical answer. Which is the more correct must be determined 
by experiment; but we shall not discuss this point here. 

When quantum mechanics implies that hot all of the classically 
permissible values of a dynamical variable can be results of a 
precise experimental measurement of this variable, we say that 
this dynamical variable is quantized; for example, we say that the 
energy of a linear harmonic oscillator is quantized, for, instead 
of all the positive values of the energy expected on the classical 
theory, only those contained in (15) are the possible results of a 
precise measurement. The possible values of the energy of a 
system are called the energy levels of the system, and their totality 
is called the energy spectrum^ or the term spectrum^ of the system. 
The term spectrum of the linear harmonic oscillator is entirely 
discrete, the successive energy levels being separated by the 
amount hve ; and the lowest possible value of the energy is not 
zero, but 

Eo^ihvc, (16) 

the so-called zero-^pomt energy of the oscillator. The five lowest 
energy levels of a linear harmonic oscillator are shown diagram- 
matically in Fig. 22. Pure numbers, such 
as 71 in (15), contained in quantum-mechani- 
cal formulas for the possible values of a 
dynamical variable are called quantum num- 
bers. The Bohr theory of the atom — the 
forerunner of quantum mechanics — gives 
En = nhve , rather than (15), as the formula 
for the energy levels of a linear harmonic 
oscillator. 

Incidentally, quantum mechanics does 
not imply that the energy of every dynami- 
cal system is quantized; in this connection, 
the reader might at this point read the first 
two paragraphs of §28. 

Exercises 

1. The pendulum, of an ordinary clock executes approximately simple 
harmonic motion. Discuss the feasibility of discriminating, by experi- 
menting with such a pendulum, between the classical and the quantum- 


j/lVc 

{hvc 

"Z 

\hv^ 


Fig. 22. Energy- 
level diagram for a 
linear harmonic oscil- 
lator. The thick lines 
show the five lowest 
levels given by (15). 



85 


INTRODUCTORY PROBLEMS 


mechanical conclusions regarding the possible values of the energy of a 
linear harmonic oscillator, 

2. Show that a normalized eigenfunction of <2) belonging to the eigen- 
value En given by (15) has the form e'‘'^^NnHn{x/a) exp {—h{x/ay], where 
7 „ is independent of x but otherwise arbitrary, Hn is the nth Hermite 
polynomial 37^ a is given by (9), and Nn = l/\/fl2"n! ttL 

3. Let A be an arbitrary real constant, let H be the operator (2), and 
let / be any function for which the integrals in 0) are finite. Show 
that then 

J - A)^fdx = f\(H-A)f |» dx. (17) 

16. Schroedinger Functions 

The portion of quantum mechanics discussed so far and yielding 
the one-dimensional first Schroedinger equation enables us to 
calculate the energy levels, that is, the possible results of a precise 
measurement of the energy, for any conservative one-dimensional 
system; the procedure is quite direct: when confronted with a 
system described by a specific potential V{x), we seek mathe- 
matically the eigenvalues of the operator i? — —K^^d^/dx^+ 

The details of the computation may be very involved on account 
of mathematical complications; but in principle the procedure is 
straightforward. Consequently, were we interested exclusively in 
the energy levels of conservative one-dimensional systems, we 
would not need to introduce any quantum-mechanical assump- 
tions beyond those stated already. We shall not, however, 
proceed at once to systems other than the oscillator, but, in order 
to get a more detailed idea of the quantum-mechanical behavior 
of an oscillator, we shall first consider the information that the 
theory provides regarding results of experiments other than 
measurements of energy. To do this, we must examine more 
closely the operands of the Schroedinger scheme (that is, the 
Schroedinger functions that are involved only indirectly in the 
calculation of energy levels) and must augment our list of assump- 
tions. This will be done in this section and the next; and illustra- 
tions of the physical implications of our formal results in specific 
problems will be begun in §18. 

The only condition imposed so far on the Schroedinger operands 
is the requirement of good behavior. We now add another, which 
we denote by III®, rather than by III, to indicate that it refers 
specifically to the Schroedinger method. 



86 the SCHROEDINGER METHOD §16 

m®. Every Schroedinger operand pertaining to a given dy- 
namical system depends on the time i in such a way that 

Hyp = ih ~ {!) 

where H is the Schroedinger operator associated with the Hamil- 
tonian of this system. 

The partial differential equation (l) is called the second Schroe-- 
dinger equation. It is to be understood from now on that (with 
a minor exception to be mentioned in a moment) all Schroedinger 
functions depend on t, and that the symbol by which we shall 
continue to denote them, is, in the case of one-dimensional sys- 
tems, an abbreviation for t). 

To illustrate: In the case of a linear harmonic oscillator, H is given 
by 215, and {!) becomes 

so that, when the Schroedinger method is used for the study of a linear 
harmonic oscillator, then every operand, besides being a well-behaved 
function of x, must depend on t so as to satisfy the partial differential 
equation (2). 

We shall now outline a procedure for computing the Schroe- 
dinger function t)^ belonging to a one-dimensional 

system with a Hamiltonian H, for times other than ^ = 0, if it is 
known to be i/' = 0) at i = 0. The desired function must 

satisfy (l) and must reduce to at ^ = 0; since (l) is a differ- 
ential equation of the first order in b/dt, the problem has a. unique 
solution. 

We consider first the case when / is an eigenfunction of the 
operator , so that, denoting it by , we have 

m% = Exp\, (3) 

where E is a constant. Now, the function 

• Pb = (4) 

satisfies (l), since 

HPb = = Ep, (5) 

and 

= ihpli~iB/h)e-'’^‘''‘ = Ep^; 


( 6 ) 



87 


INTRODUCTORY PROBLEMS 


§16 


further, (4) reduces to at i = 0. Consequently, our first 
problem has been solved: an eigenfunction of H belonging to the 
eigenvalue E depends on the time through the factor a 

factor whose formal appearance is the same for all dynamical 
systems. As shown by (5), in the case of a conservative system, 
a Schroedinger function which, at ^ = 0, is an eigenfunction of H 
belonging to the eigenvalue E will always remain an eigenfunction 
of H belonging to the same eigenvalue. Again, we have = 
, so that an eigenfunction of Hj if normalized at i = 0, will 
always remain normalized. Note that, if E in (4) should equal 
zero, we get an exception to our statement that all Schroedinger 
functions depend on t; but this exception is too unimportant to 
emphasize. 

We can now write down the explicit expressions for those of the nor- 


malized Schroedinger functions, pertaining to the oscillator, which are 
eigenfunctions of the operator H of the oscillator. In view of the results 
of this section and of Exercise 2^® wc have: 


w = 0 , 1 , 2 , 

•••, (7) 

where 


(8) 

En = (n + i)kvr., 

Nn *= l/\/a2”n!7rb 


il 

(9) 


Hn is the Hermitc polynomial 37*, and the 7 's arc real and independent 
of both X and t, but otherwise arbitrary. The results of Exercise 5® show 
that the functions ( 7 ) form, at any instant t, an orthonormal set, complete 
with respect to functions that are of integrablo square in the infinite 
interval and satisfy certain mild continuity conditions. 

In describing a method for deducing = yp(,Xf t) from = 
0) when \l/^ is not an eigenfunction of //, we shall restrict 
ourselves to the case of the oscillator; that is, we shall look for 
that function which satisfies (2) and reduces to for t = 0, 
The method employs the i/'-f unctions (7), which have the form 

4 ^. = ft = 0, 1, 2, . . . (10) 

and satisfy the equations 


H\f/n = JEni/fi, 


( 11 ) 



88 


THE SCHROEDINGER METHOD 


§16 


where H is the operator associated with the Hamiltonian of the 
oscillator. We assume that the arbitrary phases of the functions 
(7) have been fixed by some suitable convention. Letting the 
superscript ® denote that f = 0, we write 

- M^)> n = 0, 1, 2, • ■ ■ . (12) 

We nejrt expand the given function in terms of the ortho- 
normal set (12), getting 

= E-CnVl'n, w = 0,1,2,..., (13) 

the expansion coejBScients being given by 21®, so that Cn = 
dx. Consider now the function 

^ — 0, 1, 2, * * * j (14) 

where the c’s are the same as in (13), and where the are the 
time-dependent functions (10). We have 

Hi/ - H Cni/n = CnEn^Pn^ (l5) 

and 

Jt 2« = E" 'Pn = En CnK'I'n, (16) 

SO that (14) satisfies (2). Furt&er, (14) reduces to for = 0, 
and hence it constitutes the solution of our problem. The case 
when does not admit of the expansion (13) is of no physical 
interest. 

Exercises 

1. How does the choice of the phases of the functions ( 7 ) affect the 
c's in (14)? How does this choice affect the function (14)? 

2 . Outline the computation of \p{Xf t) for an oscillator when to) 
is known, and 5 *^ 0 . 

3. Use 46* to show that if 7 is a positive integer and \j/n is (7) then 

J rPndx^himf j '^n^^ndx. (17) 

4. Show that if is a normalizable solution of < 1 ) then 

j^J^dx^O, (18) 

even if ^ is not an eigenfunction of H. Hint: first differentiate with respect 
to t under the integral sign. 



89 


INTRODUCTORY PROBLEMS 


§17 


17. Specification of States 

Calculations of the results of certain experiments on a dynamical 
system must be based on information specifying both the kind 
and the state of the system under consideration. The state of a 
classical harmonic oscillator, for example, may be specified by 
designating the precise initial conditions of the motion, as in §10; 
by stating that its energy is E and that nothing else is known, 
as in Exercise 8^^; by designating the distributions in x and in p 
at some instant , as in Exercise 1^^; or in some other manner. 
Such descriptions of the state are readily translated into mathe- 
matical terms and enable us to proceed to calculations concerning 
outcomes of experiments. 

In quantum theory it is of course also necessary to have in- 
formation regarding the kind of dynamical system, and we already 
know how this information is translated into mathematical terms: 
the hind of system is taken care of by the operator V in the 
operator H associated with the Hamiltonian. Again, in certain 
quantum-mechanical problems it is also necessary to have physical 
information concerning the state of the system, and to summarize 
this information mathematically before calculations of the physical 
properties of the system in this state can be begun. It is our 
next task to consider how a verbal specification of the state of a 
system is to be translated into the mathematical terms of the 
Schroedinger method, and how, this translation having been 
made, calculations of results of experiments are to be carried out. 
In this connection we introduce the following general assumption, 
which we shall call the operand rule: 

IV. States of dynamical systems are to be associated with the 
operands of the quantum-mechanical operators, and vice versa. 

This assertion, taken by itself, is not very informing, since it 
does not provide a method for associating operands with states; 
it does suggest, however, that, in the Schroedinger scheme, states 
are associated with and specified by well-behaved functions satis- 
fying 1^®, and that, when verbal information concerning the state 
of a system is provided, the computer's first task is to determine 
just which i/'-function summarizes this information mathe- 
matically. 

The operand rule is elaborated in the Schroedinger method by 



90 


THE SCHROEDINGER METHOD 


§17 


the following assumption, about which most computations center; 
in stating it, we use the symbol av^ a (read, for short, ^the aver- 
age of a for the state to denote the expected average of the 
dynamical variable a for the state specified by the Schroedinger 
function ^ : 

V®. When a dynamical system is in the state specified by the 
Schroedinger function t/', then the expected average, av^o;, of any 
dynamical variable a is given by the formula 


f xpocd/diT 

J ^dr 


( 1 ) 


where a in the integrand is the Schroedinger operator associated 
with the dynamical variable a, and where the integrals extend 
over the entire configuration space. 

We shall call (l) the Schroedinger expectation formula. Until 
Chapter XII we shall be concerned with one-dimensional motion, 
and consequently there is for the .present no need to define the 
term configuration space in detail. Suffice it to say that in the 
cose of a single particle confined to the a;-axis the term ^entire 
configuration space ^ means the totality of points comprising the 
rc-axis. In the one-dimensional case our latest assumption re- 
duces, in fact, to the following form: 

Yi . When a one-dime7isional system is in the state specified by 
the Schroedinger function then the expected average, Bv^a, of 
any dynamical variable a. is given by the formula 


== 



\}/a\l/ dx 
ijnpdz 


( 2 ) 


where a in the integrand is the Schroedinger operator associated 
with the dynamical variable a. 


The expectation formula (2) implies, for example, that, if we 
are told that the state of a one-dimensional system is specified by 
a given Schroedinger function then we can compute the average 



91 


INTRODUCTORY PROBLEMS 


§17 


value of the momentum p of the particle for this state by evalu- 
ating the right side of the equation 

I i[/dx 

av^p = < ^ ^ . (3) 

/ inpdx 

Note that, since xp in (2) is a function of t, slv^ a is in general also 
a function of tj even though the dynamical variable a may have 
the form a = a(Xj p) and not depend on t explicitly. 

The implications of the expectation formula (2) are far-reaching. 
Thus, having been given the Schroedinger function specifying the 
state of a one-dimensional system, we can compute the expected 
average of any dynamical variable for this state for any time; 
again,' by calculating the respective expected averages of all 
integral powers of a, we are, in view of the uniqueness theorem of 
§11, enabled, at least in principle, to infer the distribution-in-a 
for this state; in particular we can compute the inexactitude, to be 
denoted by of a for this state by means of the formula 

(20^^) a = av^ — av^/ a, (4) 

and test the possibility that the result of a precise measurement of 
a be a certainty. 

Conversely, having been given a verbal specification of a state, 
wc can determine the Schroedinger function that will specify the 
state mathematically, by looking for that xp which, when used 
in (2), yields data in agreement with those given us; this done, 
we are ready to make computations concerning further properties 
of the system in this state, computations in which, as we shall 
sec, Equation <2) again plays a central part. 

The following throe general implications of (2) arc very im- 
portant: 

1. Since operators associated with dynamical variables are 
linear, a substitution of exp fov xp in (2) does not affect the right 
side of (2) if c is a numerical constant. The results of quantum- 
mechanical calculations for a state specified by a Schroedinger 
function xp are thus exactly the same as those for a state specified 
by the Schroedinger function cxp, and we conclude that, if c is a 
constant^ then the Schroedinger functions \p and exp specify the same 
state. In particular, the i/'-function obtained by normalizing a 



92 


THE SCHROEDINGER METHOD 


§17 


given ^-function specifies the same state as does the original 
^-function; hence, in dealing with a normalizable unction, we 
may, without loss of generality and usually with a gain in sim- 
plicity, replace it by the corresponding normalized ^-function. 

2. If the dynamical variable ce is a function only of the coordi- 
nate X, that is, if a = f(x), then it follows from (2) and 8^* that 


a,v^f(x) 


f ^ix)\l/dx 
i ^dx 


(5) 


If the operator a involves differentiation, it is of course not per- 
missible to rearrange the factors in the integrand in the numerator 
of (2) ; but (6) can obviously be rewritten as 


av^/(a;) = 


J f(x)^dx 
jifrpdx 


( 6 ) 


Equation (6), valid for any f(x), holds in particular when /(a:) = 
a:*, = 1, 2, 3, • • • . Therefore, a comparison of (6) with 27“ 

in the light of the uniqueness theorem of §11 yields the result: 
when a one-dimensional system is in the state specified by the 
Schroedinger function then the distribution-in-x of the particle 
is For this reason, one-dimensional Schroedinger functions 
are called amplitudes of distributions-in-a:, or simply probability 
amplitudes. If V' is normalized to unity, then is the absolute 
distribution-in-a; for the state specified by the Schroedinger func- 
tion 1 ^. In the sequel we shall, for brevity, usually say 'the 
state rp' instead of ‘the state specified by the Schroedinger 
function 

3. If the state is specified by a ^-function that is an eigen- 
function of the operator a associated with the dynamical variable 
a, then 

ocil/ - a\p, (7) 

where a is a numerical constant, so that (2) yields the result 

af^dx _ /o\ 

; — - — — = — = a, \o/ 

^ ^ ^dx 

and also the result 


2 i^aypdx 
av^oj — ^ ^ 


J ^ _ 2 

— c , 


f ^dx f^dx 


( 9 ) 



93 


INTRODUCTORY PROBLEMS 


§17 


so that according to (4) 

A^ol = 0 . ( 10 ) 

Now, as pointed out in §11, the inexactitude of a quantity cannot 
v^anish unless only one value of this quantity is possible, and we 
conclude : if \p is an eigenfunction of the operator a belonging to its 
eigenvalue a, then the result of a precise measurement, of the dynam- 
ical variable a when the system is in the state ^ is certainly a. 

Equation (2) enables us to compute the average of any dy- 
namical variable a for any state for which the Schroedinger 
function, that is, the amplitude of the distribution -in-o;, is known 
A calculation of av^a by means of (2) involves no preliminary 
inquiries concerning the distribution-in-« for the state in question, 
even if a is a function of both z and p, or of p alone; all we need 
to do is to use the appropriate operator in the integrand of (2). 
This procedure has no analogue in classical mechanics or sta- 
tistics where distribution functions, rather than their amplitudes, 
are of primary significance. 


18. Energy States of an Oscillator 


In Exercises 8^^ and 9^^ we investigated on the classical basis 
some of the properties of an oscillator on the assumption that 
the energy of the oscillator is known to be certainly E and that 
no other information is available. We are now prepared to con- 
sider the corresponding problems from the quantum-mechanical 
standpoint. The i/^-functions with which we shall have to deal 
are of integrable square (Exercise 14); therefore, in view of 
theorem 1 of §17, they may without loss of generality be assumed 
to be normalized. If p is normalized, that is, if 



( 1 ) 


the Schroedinger expectation formula 2^^ becomes 


av^.a = I papdx. (2) 

y~oo 


Thus, for example, the expected average of the energy of the 
oscillator for a state specified by a normalized Schroedinger 
function ^ is 

&\f,E = J ffHipdx, (3) 



94 


THE SCHROEDINGER METHOD 


where 

+ *»-’• < 4 > 

Similarly, the expected average of the square of the energy for 
this state is 

= j ( 6 ) 

The basis on which we regard equations such as (3) and (5) as 
correct is, of course, the assumption V? of §17. 

The Schroedinger function for the normal state. The state- 
ment that the energy of the oscillator is certainly E means that 
the result of a precise experimental measurement of the energy is 
certain to be E. According to quantum mechanics, the only 
possible results of a precise measurement of the energy of an 
oscillator are 

(15“) En= {n + k)hvc , n = 0, 1, 2, .... (6) 

Hence, saying that the energy of the oscillator is certainly E 
implies, in quantum theory, that E is one of the numbers listed 
in (6). 

For the present, we shall restrict ourselves to an oscillator 
whose energy is certainly 

Eo = ^hve = ^hy/k/m, (7) 

the lowest value listed in (6); an oscillator whose energy is cer- 
tainly Eh is said to be in its normal state. Our first concern is to 
identify the Schroedinger function describing the normal state; 
that is, we must deduce the Schroedinger function from the in- 
formation that the energy of the oscillator is certainly Eo . 

Now, the energy for a state is certainly Eo if, and only if, the 
average of the energy for this state is Bo and the inexactitude of 
the energy is zero; further,' in view of 20“, the inexactitude of 
the energy vanishes if, and only if, the square of the average of 
the energy equals the average of the square of the energy. Con- 
sequently, if 4/ is the normalized Schroedinger function that we 
are seeking, then the statement that the energy of the oscillator 



95 INTEODUCTOEY PROBLEMS §18 

for the state f is certainly Eo can be summarized by the two 
equations 

f <?a: = So, / dx = El, (8) 

where H is the operator (4). 

Consider now the quantity / If ^ satisfies 

Equations (8) and (1) we have 

J - ^o)V (fe 

= f V dx-2Eof w dx + El f Udx = 0; (9) 

again, the finiteness of the integrals (1) and (8) implies that our 
\p satisfies the conditions of Exercise 3^^ and that hence 
J - Eof\fy da? = J |(H - Bo)\p P dx, We thus get the 
equation 

f KH - E,)xl> I* dx = 0. <10> 

The integrand in (10) is nowhere negative, and hence it must 
vanish everywhere iu order that (lO) may hold. Consequently, 
(H — Eo)yl/ = 0] that is, the ^-function that we have been seeking 
is a well-behaved solution of the equation 

Hyp = E4'. ' (11) 

Equation (ll) is just the Schroedinger equation S'* for the 
oscillator, with E set equal to Eq , and its normalized solution, 
denoted by ypo , is given in Exercise 2**; this solution, adjusted to 
be properly time-dependent, is given in 7“: 

f, = ( 12 ) 

where 

a ,= 2tvc = ’Vk/rrij (13) 

and where y is an arbitrary real constant independent of both 
X and t. 

Our first task lias been accomplished: we now know that the 
normalized Schroedinger function, specifying the state of the 
oscillator for which the energy is certainly Eoi is (12). 



96 


THE SCHROEDINGER METHOD 


§18 


By the theorem 3 of §17, the condition that ^ be an eigen- 
function of H belonging to the eigenvalue E is sufficient in order 
that the result of a precise energy measurement for the state \p 
be certainly B; the present argument illustrates that this condi- 
tion is also necessary. 

The distributioa-in-x for the normal state. In §17 we con- 
cluded that when a one-dimensional system is in the state specified 
by the Schroedinger function then the distribution-in-a; of the 
particle is The distribution-in-a; for an oscillator whose 
energy is certainly Eo [we denote this distribution by Po(a?)] is 
consequently 

PqC^) == ^0^0 = . (14) 

a\/T 

This absolute distribution, unlike its amplitude (12), is inde- 
pendent of the time and is free from any arbitrariness. A 

graph of (14) is shown in Fig. 23, 
where Y is the potential and 
the horizontal line, labeled Eo and 
drawn to the same scale as V, repre- 
sents the total energy of the oscil- 
lator. Fig. 23 also shows by a 
dotted line the distribution-in-a; 
of a classical oscillator having the 
same mass, restoring constant, and 
total energy as those of the quan- 
tum-mechanical oscillator whose 
distribution function is (14); the 
classical distribution is obtained 
from 25^^ and is drawn to the same scale ss ipoh* The two dis- 
tributions are seen to he very different from each other. 

Perhaps the most striking property of the quantum-mechanical 
distribution in Fig. 23 is that the distribution-iu-r does not 
vanish outside the classical limits of x for oscillator of energy 
Eq, that is, outside the limits — '\/2Bo/fe \/ 2Eo/k : according 

to the quantum theory, a particle may te found in regions that are 
not accessible to it on the classical basis. This feature of the theory 
provides an interpretation of certain physical phenomena that 
cannot be accounted for classically. 



Fig. 23. The distribution- 
in-o; of a linear harmonic oscil- 
lator in the normal state (thick 
solid curve). 



97 


INTRODUCTORY PROBLEMS 


Since a direct experimental measurement of p will obviously 
never yield an imaginary value for p, the quantum-mechanical 
possibility of finding the particle outside the classical limits im- 
plies that the classical relation connecting the numerical values 
of Xy Pj and E of the oscillator at any instant, that is, 

E = ^p^ + ^kx^ (16) 

*does not hold in quantum mechanics. The situation is somewhat 
as follows: According to the classical view, we can determine the 
energy of an oscillator either by some direct measurement, or by 
measuring the precise simultaneous values of x and p at some 
instant and substituting the results into the right side of (15). 
But, according to quantum mechanics, although a precise meas- 
urement of xor oip at any instant is possible, these measurements 
cannot be carried out simultaneously (§§23 and 24); hence, no 
equations relating simultaneous values of x and p, or simultaneous 
values of x^ p, and E, appear in the theory. The fact that (15) 
is not to be found among quantum-mechanical equations signi- 
fies, in essence, that, if the energy of an oscillator is to be deter- 
mined, it must be measured directly ^ and that it cannot be deduced 
indirectly from the results of measurements of x and p. 

If a precise experiment, designed to determine the position of 
the particle, be performed at the instant U on an oscillator in the 
normal state, a definite value of x will be obtained, lying, according 
to (14), somewhere in the infinite region from — oo to <» ; but 
such a measurement will violently perturb the oscillator, so that, 
for example, were a precise measurement of p to follow imme- 
diately the measurement of the result of the second measure- 
ment could not be expected to be the same as it would have been 
had the measurement of x not been made. 

The absolute distribution-in-a; being available, we can calcu- 
late by the standard statistical method the average of any function 
of X for the normal state. Using the symbol avo to denote an 
expected average for the state , we have, for example, 



a Vo a; 


(16) 



98 


THE SCHROEDINGER METHOD 


§18 


If s is an odd integer, (16) vanishes, since the integrand is odd in x 
and the interval of integration is symmetric about x = 0. If 
5 = 2j, where j is a positive integer, we get, using 24:b^^, 

avoa;^^ = l*3*5---(2j — (17) 

In particular, avoo;^ = that is, 

avoo:^ = Mo/L (18) 

Comparison of (18) with the corresponding result of Exercise 8^^ 
shows that, in spite of the dissimilarity between the classical and 
the quantum-mechanical distributions-in-a; of Fig. 23, the expected 
average of is the same for both. The inexactitude of x for the 
normal state is 

^qX = ^/siVcx'^ — avja; = V Ea/k. (19) 

The distribution function of any dynamical variable that de- 
pends explicitly only on x can be calculated, for the normal state 
of the oscillator, by means of (14) and 24^\ 

The distributioa-in.-j5» for the normal state. Let us consider 
next the momentum of a linear harmonic oscillator in the normal 
state. According to (2) we have, for example, 


that is, 


r - / d\ 
avop"' = I ^0 { —fA -- ] 
J \ 9^/ 


( 20 ) 

( 21 ) 


Since V'o is even in avo p* vanishes whenever s is an odd positive 
integer, because the integrand is then odd in x. When s = 2j, 
where j is a positive integer, we have, in view of (20) and 17^^ 


that is. 


avop^-' = J ^0 ^ dx = J dx^ 

avop^^ = 2i-WQX^\ 


( 22 ) 

(23) 


so that, according to (17), 

avop'" = 1-3.5 - . . (2; - 1)2”'6'^', 


(24) 



99 


INTEODUCTORY PROBLEMS 


§18 


where 

b = a\/km = (25) 

111 particular, avop^ = that is, 

= mEo) (26) 

a result in agreement with the corresponding classical result of 
Exercise The inexactitude of p for the normal state is 

AqP == Vavop^ - avSp = \/m^. (27) 

The product of the respectiv'c inexactitudes of x and p for the 
normal state is of interest. From (19) and (27) we get 

AqX AoP = Eoy/mfky (28) 

in agreement with the corresponding classical result of Exercise 

9^^* but, eliminating Eo by means of (7), we find that 

TL 

Aoa: AoP = t-, (29) 

4ir 


so that Aoa: Aop of a linear harmonic oscillator is a universal 
constant independent of m and k. 

We now proceed to find the distribution-in-p, Po(p), for the 
normal state of the oscillator. Were Po(p) available, we could 
calculate avop’ by the standard statistical naethod: 

avop' = f p‘ P;{p') dp. (30) 


Now, we do not know what Po{p) hs, but we do know from (21) 
the values of (30) for all iJOHitive integral values of s: 



p’Foip) dp 


ri-s-s 

lo 


(2j - l)2->h*' if s = 2j, 

if s= 2j4- 1. 


(31) 


In view of the resemblance of these formulas to those which we 
proved a moment ago for the coordinate a:, that is 



x’P~XF! dx 



(2y - if s = 2j 

if s = 2j + 1, 


(32) 



iOO 


THE SCHROEDINGER METHOD 


§18 


Po(p) is found rather simply. Indeed, we note that, if we use 
for Po{^) in (31) the function obtained from Po(^x) by writing p 
for cc and h for a] then (31) will follow from (32); and [since, in 
view of the uniqueness theorem of §11, Po(p) is presumably deter- 
mined by Equations (31) uniquelyf we conclude that the distri- 
bution-in-p for an oscillator in the normal state is 

Po(p) = T . (33) 

The distribution (33), superimposed on the classical distribu- 
tion-in-p of an oscillator . of energy Bo obtained from is 

shown in Fig. 24. The two distributions are quite different from 

each other; in particular, quantum 

mechanically, a precise measure- 
ment of the momentum of an oscil- 
lator of energy Bo may yield any 
value between — oo and oo , while 
classically only values between 

— \/2mBo and \/2mE^ are possible. 


The method used above for comput- 
ing Po(y) is effective only because of 
the symmetry of the Hamiltonian of the 
oscillator in a; and p; for other systems 
a comparison of the averages of powers 
of p with those of powers of x is not 
very helpful, and in §27 we shall develop a general method by which the 
distribution-in-p can be calculated directly from the Schroedinger func- 
tion for any one-dimensional system. 

Before (33) was established, we had only one way to calculate 
the expected average of a function of p for the normal state, that 
is, by using (2) ; 

avo/(p) = J (34) 

but now we have an alternative method that will yield the same 
result: 



Fig. 24. The distribution- 
in-p of a linear harmonic oscil- 
lator in the normal state (thick 
solid curve). 


avo/Cp) = ^ J dp. (35) 


^ For a proof of the validity of the uniqueness theorem in the present 
case, see Uspensky, page 383. 



LOl 


INTRODUCTORY PROBLEMS 


§18 


The distribution function of any dynamical variable that 
depends explicitly only on p can be computed, for the normal 
state of the oscillator, by means of (33) and 24“. 

If a dynamical variable depends on both x and p, then its 
average and the averages of all of its powers can be computed for 
the normal state of the oscillator by means of (2) and (12) ; conse- 
quently, the distribution function of every dynamical variable 
that depends only on x and p can be found, at least in principle. 
The phase factors of \po and cancel in (2), and therefore the 
average of any dynamical variable is free from any ambiguity. 
The time factors of \pQ and also cancel in (2), so that the expected 
average of any dynamical variable not involving t explicitly is 
independent of t, and hence the respective distribution functions 
of all such variables are independent of t; for this reason, the 
state xpo is called a stationary state. 

Summary. We began the study of the normal state of an 
oscillator by translating into mathematical terms suitable for 
further calculations by the Schroedinger method the statement 
that the energy of the oscillator is certainly Ea ; that is, assuming 
the energy to be certainly , we identified the particular Schroe- 
dinger function t/'o that would properly summarize this informa- 
tion. The amplitude of the distribution-in-x being then known, 
we discussed the distribution-in-x and computed the averages of 
the powers of x. We turned next to the momentum, calculated 
the averages of the powers of p by means of our fundamental 
equation (2), and, finding these averages to be very similar to the 
averages of the powers of x, were able to infer the distribution-in-p 
for the normal state. Our illustrations of the manipulation of the 
Schroedinger function in the course of computations of the 
physical properties of a state were chosen partly on account of 
their mathematical simplicity, but our examples are perhaps 
sufficient to show how the answering of any question relative to 
the outcome of any experiment, to be performed on an oscillator 
in the normal state and designed to measure the value of any 
dynamical variable not depending explicitly on can be reduced 
to a definite mathematical problem by the straightforward appli- 
cation of (2). 

The questions that we treated are analogous to those con- 
sidered from the classical standpoint in Exercises 8“ and 9“. 
The mathematical methods that we employed in the classical 



102 


THE SCHROEDINGER METHOD 


§18 


theory and in the quantum theory of these analogous problems 
cannot rery well be compared; they are different, and little more 
can be said. But the respective physical conclusions drawn on 
the basis of the two theories can be compared in detail ; we made 
some comparisons and found illustrations of fundamental incom- 
patibility, such as quantization of the energy, the ^possibility that 
a precise measurement of cc will yield a value lying outside the 
classical limits, and so forth, as well as illustrations of agreement, 
such as the fact that both the classical and the quantum-me- 
chanical distributions-in-a; are stationary, that Aa; is the same for 
both distributions, and so on. We shall now contrast the physical 
background used in the classical treatment of these problems with 
that employed in the quantum-mechanical calculations. 

In Exercises 8^^ and the classical treatment of an oscillator 
was based on the premise that the only information available 
about the condition of the oscillator is that the energy is certainly 
E; that is, we assumed that 

(a) the energy is certainly E, 

(b) no further information is available. 

It was quite essential to make the assumption (b) explicitly, 
because information relevant to the problem and consistent with 

(a) could be given in addition to (a) in such a way as to lead to 
results quite different from those that we derived from (a) and 

(b) ; for example, if, in addition to (a), we were given the values 
of X and p at some instant U , our conclusions would be expressed 
in terms of certainties rather than probabilities. In other words, 

(a) does not alone constitute a maximal set of conditions from 
the classical viewpoint. 

The classical distribution-in-a;, 25^V was derived from (a) and 

(b) along lines such as these: it follows from (a) and from the 
classical laws of motion that the particle mov es on a perf ectly 
definite schedule between the points -~\/2E/k and '\/2E/k, and 
has a calculable speed at every point of its path; it follows from 
(b) that we are not able to set up any connection between the 
schedule of the particle and the time as recorded by our laboratory 
clock; consequently, the probability that the particle will be found 
in drc at re at the time t is proportional to the time spent by the 
particle in dx at x in the course of its motion and is independent 
of the instant t at which the measurement is to be made; this 



103 


INTRODUCTORY PROBLEMS 


§18 


probability is therefore inversely proportional to the known speed 
with which the particle passes the point x. This reasoning yields 
the distributioa-in-.T, and similar reasoning can be used in finding 
the distribution functions of other dynamical variables. 

The quantum-mechanical treatment of the problem proceeded 
along quite different lines. We assumed that 

(a') the energy is certainly Eq , 

and showed how any physical question regarding the oscillator 
could be answered by straightforward mathematical means and 
without ambiguity on the basis of this information alone. No 
references to details of the motion were made; it was not implied 
that the particle moves on any particular schedule, or that its 
speed varies in any definite way; in short, no relation of the 
classical type x = Fit) was presumed to exist between the posi- 
tion of the particle and the time. [Except in the case of heavier 
particles, when many of its results merge into those of classical 
theory, quantum mechanics does not yield results of the type 
X = F( 0 , and hence docs not permit us to form mental pictures 
of the manner in which a particle behaves between observations.] 

The fact that no assumption similar to (b) was necessary before 
we could proceed with the calculations and carry them out to 
the end implies that no assumption relevant to the problem, 
compatible with (a'), and not included in (a') can be made in 
the quantum theory of the oscillator. This result holds also when 
Eq is replaced by En , and means that, according to quantum 
mechanics, it is not possible experimentally to establish the fact 
that the energy of the oscillator is a certainty, and (still being 
sure of the energy) to get further information regarding the 
particle, information that cannot be deduced from the knowledge 
of the energy alone. For example, according to quantum me- 
chanics, it is impossible to make sure experimentally that at 
i = <0 the energy of the oscillator is certainly Eq and that the 
particle is at that instant in a definite position. Thus, from 
the quantum-mechanical standpoint, the condition (a') is alone 
a maximal set of conditions for the oscillator. 

Excited states. States of the oscillator for which its energy i.s 
respectively certainly Ei = ^hve , E2 = i'hve , and so forth, are 
called its first excited statCj its second excited statCj and so on. 
Any state for which the energy is a certainty is called an energy 



104 


THE SCHROEDINGER METHOD 


state. The state of the oscillator for which the energy is certainly 
JSn = is called the Tith energy state or the nth quantum 

state. A state for which the value of a d5niamical variable o: is a 
certainty is often called an eigenstate of a ; energy states are thus 
eigenstates of the Hamiltonian. 

The argument used at the beginning of this section to identify 
the ^-function describing the normal state leads to the conclusion 
that the normalized i/'-fimction describing the state for which the 
energy of the oscillator is certainly En is the function given by 
7^®. Graphs® of the right-hand halves of the distributions-in-rc 
for w = 0, 1, 2, 3, and 4 are shown in Fig. 25; the first diagram 

in this figure is a part of Fig. 
23. The corresponding class- 
ical distributions are also 
shown, and it is seen that, for 
the higher values of the quan- 
tum number n, the quantum- 
mechanical distribution is 
more like the corresponding 
classical one, and that, in par- 
ticular, the probability of find- 
ing the particle outside the 
classical limits is smaller for 
higher energies. An interest- 
ing characteristic of the 
excited states is the nodes of 
the distribution-in-a;, that is, values of x at which the distribu- 
tion-in-a; is zero. They signify impossible values of r, that is, 
places at which the particle can never be found by a precise 
position measxjrement, although it may be found on either side 
of a node. These nodes may serve to emphasize the fact that 
quantum mechanics concerns itself with results of observations, 
and not with mental pictures of the behavior of a particle 
between observations. 

In the exercises below, we let Pn{a), av,»a, and A^ca denote, 

* Taken from E. IJ. Condon and P. M. Morsels Quantum Mechanics^ 
page 51. New York; McGraw-Hill Book Co., 1929. 

• ‘Imposible value’ is a stronger restriction, than ‘value having zero 
probability,’ since, in the case of a continuous distribution, any preassigned 
value has zero probability. 



n-4 


Fig. 25. Difitributions-in-a; for en- 
ergy states of a linear harmonic oscil- 
lator. The quantum-mechanical dis- 
tributions are marked Q; the classical 
distributions for the same m, k, and 
E are marked 0. 



INTRODUCTORY PROBLEMS 


105 


§18 


respectively, the distribution-in-a, the average of a, and the in- 
exactitude of OL for the nth energy state of the oscillator. 

Exercises 

1. Show that, if a linear harmonic oscillator is in its normal state, then 
the probability of finding the particle outside the classical limits is ap- 
proximately 16 per cent. 

2. Show that, if a linear harmonic oscillator is in its normal state, then 
the expected average of the dynamical variable xp is zero. 

3. Show that the condition that the state of an oscillator be specified by 
the Schroedinger function in 7^® is, apart from normalization, both neces- 
sary and sufficient in order that the energy of the oscillator be certainly JE'„ . 

4. Show that 

» 6 ) 

and that the n nodes of Pn(a?) lying at finite values of x lie within the 
classical limits of x, 

6, Show that avn a;* = 3(n* -|- w -|- 4)^J/2(n -j- and note that, 

when n — > 00 , this result approaches the corresponding classical result of 
Exercise Hint', use 63^ twice over and remember the orthogonality of 
the ^'s. 

6. Show that ifi is a positive integer or zero, then av« « av^ = 
0, and aVn = mi hi avn x^i. Hint: note 17^®. 

7. The function in 7“ satisfies the Schroedinger equation -h 

ikx^^ri = .©n^n . Multiply every term of this equation on the left by , 
integrate with respect to x from — oo to oo , and show that 

^ aVn p* + ih avn a;* * . (37) 

Equation (37) is, in a sense, the nearest approach possible in quantum 
mechanics to the classical equation (15). 

8. Using (37) and the results of Exercise 6, show that avn « Hn/k, 
avn p* = mEn , and A„a: Anp = (2n -h l)h/4cir; compare with the corresponding 
classical results of Exercises 8^^ and 9^^^. 

9. Show that the respective expected averages of the kinetic and the 
potential energies of an oscillator in the nth energy state are iEn each; 
compare Exercise 11“. 

10. Let € be the dynamical variable p/ Vnifc- Recall Exercise 6 and show 
that av« = avn x* for every positive integral value of s. Infer P»(^), 
and show that 

where b is (25), and that the n nodes of Pnlp) lying at finite values of p lie 
within the classical limits of p. 

11. Show that if both (36) and (38) be expressed in terms of w, A;, and 



THE SCHEOEDINGER METHOD 


106 


§18 


En , rather than in terms of a and b, then (36) can be transformed into (38) 
by exactly the same substitutions that carry 25^^ into 26“. 

12 . Eq uation (15) implies that p can have a value outside the limits 
— yJ^mriE and '\/2mE only if x is imaginary. An apparatus designed to 
measure the position of a particle, for example, a microscope, will obviously 
never yield an imaginary result. How do these comments bear upon the 
feasibility of the quantum-mechanical claim (illustrated, for example, 
by Fig. 24:) that the momentum of a linear harmonic oscillator may, upon a 
precise measurement, be found to lie outside the classical limits of p? 

13. Use (33) to recompute the value of Ao p given in (27). 

14. Consider 2^"^ under the assumption that a is (4) and ^ is not quad- 
ratically integrable, and show that in the study of the oscillator we may 
restrict ourselves to quadratically integrable i^*s. 

19. Composition of States 

The discussion of §18 was concerned with a linear harmonic 
oscillator whose energy is a certainty. We shall next consider 
cases in which the energy is not a certainty, and, in preparation 
for a more general treatment, shall take up from the quantum- 
mechanical viewpoint the problem studied classically in Exercise 
13 . The problem is to make calculations regarding the outcomes 
of various experiments to be performed on an oscillator prepared 
as follows : we instruct a person to toss a well-balanced coin and 
then to adjust the energy of an oscillator tp be certainly Ei if 
heads come up, and to be certainly Eu , Eu 9 ^ Ei , U tails do; 
he, in our absence, does as instructed and then returns the oscillator 
to us without comment. We shall assume that, whatever the 
energy is once adjusted to be, the oscillator will retain it indefi- 
nitely (§31). Both El and Eu must, of course, now be quantum- 
mechanically permissible energy values of the oscillator. 

The energy of our oscillator is either Ei or Eji , but, on the 
basis of only the information given above, we cannot decide which. 
A suitable experiment would reveal what the energy actually is; 
we wish, however, to investigate the properties of the oscillator 
before getting any information besides that given above, and 
consequently we must allow for both alternatives. 

Our first concern is to describe the state of the oscillator by an 
appropriate Schroedinger function; therefore, we enumerate the 
conditions which this function, to be denoted hy must satisfy. 
In the first place, being a Schroedinger function describing a 
state of an oscillator, must satisfy the second Schroedinger 



107 


INTRODUCTORY PROBLEMS 


equation 

= ( 1 ) 

where H = Secondly, must not be an 

eigenfunction of i?, that is, the inequality 

H\p 7^ constant X (2) 

must hold, because otherwise ^ would specify a state for which 
the result of an energy measuremeiit is a certainty, while the 
energy of our oscillator is definitely uncertain. Further, in view 
of theorem 1 of §17 and the result of Exercise 14^®, we may take \p 
to be normalized: 

j = 1. (3) 

We next recall that in the classical treatment of the problem we 
concluded that, for the state of the oscillator in question, 

avi;^ ■ = K£i + %), y = 0, 1,2,3, ... . (4) 

The infinite set of equations (4) was obtained essentially from 
probability considerations and without reference to mechanics, 
so that we may expect it to hold in quantum mechanics as well 
as in classical theory. Now, when i/' is normalized, the Schroe- 
dinger expectation formula gives J dx for av^ E^] therefore, 
if Equations (4) are to hold in the quantum-mechanical case, our 
must satisfy the infinite set of equations 

/ dx = |(Si + Elx), i = 0, 1, 2, 3, ■ . • . (5> 
The problem of specifying the state of our oscillator by a 
Schroedinger function has now been stated in mathematical terms : 
the i/ that we need is a solution of (1), (2), (3), and (5). We 
shall not attempt at this point to deduce ^ fro}n these conditions, 
but shall merely verify that a certain function, to be written down 
presently, satisfies them all. 

Let be a properly time-dependent and normalized eigen- 
function of H belonging to the eigenvalue Ei , so that 

Hil/i = Eiipi, H^i = j = 1- (6) 

The function 'f/i is thus one of the functions listed in 7“; the 
phases of the members of 7“ will be assumed to be fixed by some 
suitable convention. Similarly, let fn be a properly time- 



108 


THE SCHROEDINaER METHOD 


■>' » to tie 

SPu-Ejx^u, ff4>ji = ihlpjj.^ jMndx=l; (7> 

the^LtT^^we ^ *’ orthogonality of 

[ = j ^ii4'idx — 0. (8) 




(9) 

rii? ?he''vlit r ® independent of a: 

(6), (7), and (8) ^p?rst, * Equations 


<10> 


V' 

so that {1) holds. Secondly, 
if# . :i««.H^, .H 

_ J_ ,, 1 

~ V2^ + (11) 

so tte tlo iaeqoolity (2> holds, siooe B, M B,, . PinaJlj, 

JiH‘^dx~ if Co-.^ + 

- i / («-“■?, + «-<‘«f„)(«»iHi^ ^ & 

- } / (0->* + 

-m+EH. 



INTRODUCTORY PROBLEMS 


109 


§20 


and hence our \l/ satisfies the infinite set of equations (5) ; Equa- 
tion (3) is a special case of (12), obtained by" letting j == 0. 

It will be shown presently that no function other than (9) 
satisfies (l), (2), (3), and (5), and consequently we adopt (9) 
as the Schroedinger function describing the state of our oscillator. 
The Schroedinger function being available, calculations con- 
cerning the outcomes of various experiments on the oscillator 
can be carried out by the routine methods based on the Schroe- 
dinger expectation formula 2^\ 

The special problem which we have been discussing, and to 
which we shall return in §22, was taken up by way of introduc- 
tion, and is not of any particular interest in itself; it does show, 
however, that physical situations can arise in which the Schroe- 
dinger function is not an eigenfunction of the operator Zf, but a 
linear combination (or a superposition) of such eigenfunctions. If 
the Schroedinger function for a certain state of a system is a linear 
combination of Schroedinger functions for some other states of 
the system, we say that the former state is a superposition of the 
latter states] for example, the state of the oscillator discussed 
above is a superposition of two energy states, one of energy Ex 
and the other of energy Exi^ 


20. l^esolution of States 

The problem considered above was that of finding the Schroe- 
dinger function for a state which is known not to be an energy 
state, but for which the distribution-in-energy is known. We 
shall now turn to the converse problem: to find the distribution- 
in-energy for a state for which the Schroedinger function is 
known, but is not an eigenfunction of the operator associated with 
the Hamiltonian of the system. The solution will incidentally 
explain why the function ip of §19 must have the form 9“. 

Let us assume then that the state of a linear harmonic oscillator 
is specified at the instant ^ = 0 by a Schroedinger function 
that is normalized, 

j = L (1) 

and that is not an eigenfunction of H; and let us calculate the 
probability, to be denoted by Pn , that a precise energy measure- 
ment to be made at the time t will yield the result En — 



no THE SCHROEDINGEE METHOD §20 

{n + i)hvo — according to quantum theory, results not of this 
form are impossible. 

Since we are interested in the results of an experiment to be 
performed at the time t, we must &st calculate the Schroedinger 
function for the time t; being given, this can be done by the 
method of §16. We have, in fact, 

i' = Y^nCn\l/n, = 0 , 1 , 2 , - * • , ( 2 ) 

where \pn is a properly time-dependent and normalized eigenfunc- 
tion of H belonging to the eigenvalue En , listed in 7^®, and where 
Cn — ! \pn being obtained from \f/n by letting t = 0. 

Now, if the respective absolute probabilities of the results Eo , 
jBi , jB 2 , and so forth, of an energy measurement to be made on 
our oscillator at the instant t are Po , Pi , P2 , and so forth, then, 
according to the standard statistical formula 8^^, the average of 
the fcth power of the energy at the instant t is 

YlnBlPn, 72 = 0, 1, 2, 3, - • • . (3) 

On the other hand, the Schroedinger function being available, 
the average of of our oscillator can be calculated by quantum- 
mechanical means; it is, in fact (since iP is normalized, so is \f/\ 
see 18^®), 

/ io:, < 4 ) 

where 4 is gi-veii by (2) and where H = -K'^d^Jdx^ ■+ 

Both (3) and (4) represent the same number, and consequently 
the P’s that we are seeking must satisfy the infinite set of equations 

2n sl Ptt = y dx for A = 0, 1, 2, • • • . <(5) 

Inserting the expansion (2) for f into (4), we get'** 

Pdx = f dx 

= / (Sj ^ih) (Zn CnEi i,) dx = £,• CfCnEt J h'Pr^ dX ) <6> 

the integral of ^{f/n vanishes for y 9 ^ n and equals unity for 7 = n 

la the equatioas that follow, the summation indices all run through 
the values 0, 1,2, 3, . 



INTRODUCTORY PROBLEMS 


111 


§20 


because of the orthouonuality of tlie i/'’s, so that we are left with 
j iH^iPdx (7) 

and (5) becomes 

'EnltPn='LnEi\c„f foi' = 0, 1 , 2, • • • . (8) 

The obvious solution of these equa,tions is 

Pn - 1 Cn l^ = 0, 1, 2, . ■ . , (9) 

and, in view of the uniqueness theorem of §11, this is the only 
solution. 

Thus the problem of calculating the distribution-in-energy for a 
state of the oscillator for which the Schroedinger function is 
known has been solved: the probability that a precise energy meas- 
urement will yield the result En xohen the state is specified by a 
normalized Schroedinger function \]/ equals the square of the modulus 
of the coefficient of \pn in the expansion of \l/ in terms of the normal- 
ized eigenfunctions of H, The process of expanding a given 
in terms of the eigenfunctions of H is usually called resolution 
of the given state into energy states. 

Summary. The physical information regarding the state of an 
oscillator may imply that its energy is a certainty; in this case, 
as was shown in §18, the state is to be specified by an eigen- 
function of H] on the other hand, the physical information may 
imply that the energy is not a certainty — an illustration is the 
oscillator of §19— and in that case a superposition of eigenfunc- 
tions of H is necessary to specify the state. Conversely, if the 
state is specified by an eigenfunction of i?, then, in view of the 
results of §17, the energy is a certainty; but if the state is specified 
by a T/'-function that is not an eigenfunction of H, then the result 
of measuring the energy is not a certaiiity, and the probability 
of getting any particular result can be computed by means of the 
theorem given in italics above. 

Exercises 

1. The function 9^® was written down without inquiry as to how it could 
be obtained. Show that the form 9^® follows from the theorem in italics 
above. 

2. We instruct a person to toss a well-balanced die and to adjust the 
energy of an oscillator to be certainly Ei if the score is 1, to bb certainly 
Exi if the score is 2 or 3, and to be certainly Em if the score is 4, 5, or 6; he, 



112 


THE SCHROEDINGER METHOD 


§20 


in our absence, does as instructed and then returns the oscillator to us 
without comment. Set up the normalized Schroedinger function represent- 
ing the state of the oscillator as defined by the information available to us, 
and enumerate the arbitrary features of this function. 

8. Show that the condition 5^® includes the condition 2^*. 

4. How does the choice of the phases of the ypnS used in (2) affect the 
P^8 in (9)? Compare Exercise 1^®. 

6. How does the distributioif-in-E given by (9) depend on t? Is the 
answer one that is to be expected from intuitive physical considerations? 

6. Sho.w that the P*s given by (9) satisfy 

7. Show that the result of Exercise 3^® follows at once from (9). 


21. Probability Packets 


We shall now consider from the quantum-mechanical viewpoint 
a special case of the problem treated classically in Exercise 1^^, 
where we investigated the oscillator on the assumption that the 
precise values of x and p at i = 0 are unknown, but that the 
respective distributions in x and in p at t = 0 are Gaussian. 

Were we to consider precisely the same problem from the 
quantum-mechanical standpoint, we would have to investigate 
first of all how the available information is to be summarized by* 
means of a Schroedinger function; this we are not prepared to do, 
and we shall therefore confine ourselves to a rather special case, 
chosen principally on account of its mathematical simplicity. 
We shall, in fact, assume outright that the state of the oscillator 
is specified at i = 0 by the Schroedinger function 




1 

air* 


(»— iCo) * 



( 1 ) 


where Xo 0, aad where a is the constant h^/km given in 9“. 
The distribution-in-a: of our oscillator at < = 0 is then 


P\x) = e 

avTT 


( 2 ) 


Now, the average of p* at t = 0 is 

= — f 

a y/v J 


(*— gp)* 

2ai 


( Q \* U— gp )® 



113 


INTRODUCTORY' PROBLEMS 


§21 


The change of variable from xtc y = x xq transforms the right 
side of (3) into 



This integral is precisely the same as 21^®, so that the average of 
p* at ^ = 0 is, for any 5, the same as that for the normal state of the 
oscillator; consequently the distribution-in-p of our oscillator is, 
at < =? 0, the same as that for an oscillator in the normal state, 
and in view of 33^® we have: 



(5) 


where b = 

Comparison of (2) and (6) with and 6^^ shows that, having 
started with the i/^-function (1), we are treating essentially the 
problem of Exercise 1^^, except that instead of all four quantities 
iTo , Ar , po , and Aj, , only one, namely, xo , is left arbitrary, while 
the others are assigned the following values: 

Po = 0, A, = a/-\/2, Ap (6) 

We note further that Ap = = \h\/hin = \hmo^ = km A| , so 

that the restriction of Exercise 3** holds. Consequently, were we 
to treat classically the problem of an oscillator whose distributions 
in X and in p at < = 0 are (2) and (5), we would find them to be the 


packets 

■| HOB 

= ;rr7= ' 

Cl 'VTT 


(20“) 

(7) 

and 



(21^^) 

1 (pH-sottm sia wf) > 

p{p) = y- e , 

mooa Vw 

(8) 

where w 

II 

< 



In order to find the corresponding quantum-mechanical dis- 
tributions, we proceed to calculate the Schroedinger function for 
the time t, that is, to find that solution of the equation 




( 9 > 


where E = -‘*"^979®^ 4- which equals our t — 0. 



114 


THE SCHROEDINGER METHOD 


§21 


To compute the required solution of (9) by the systematic 
method of §16, set all the y’s in 7“ equal to zero,^^ in which 
case the normalized eigenfunctions of at < = 0 are 

K = Q e'KO , n = 0, 1, 2, . . . , (10) 

where Nn == (a2”n!T^)"^, and expand our in terms of the set (10) : 

/ = En = E» CnNnHn , n = 0. 1, 2, • • • . (11) 

The step corresponding to that from 13“ to 14“ then yields the 
desired solution: 

^ = E» 

n = 0,l,2, (12) 

where En ^ {n + ^)hvc . 

To get \l/ explicitly, wc must evaluate the c’s in (1 1), According 
to 21®, wc have Cn == / dx] but, in view of the special form of 
our and the relation 43^, actual integration in evaluating the 
c^s can be avoided as follows. We write 43^ as 



we then set s = xo/2a in (13), making its left side identical with 
the second exponential in (14), and substitute for this exponential 
the right side of (13) ; the result, when rearranged, is 


/O — 

w - 



(15) 


and comparison with (11) shows that 


Cn 


V^! 


(16) 


This leads to a superficial simplification in the work and does not affect 
the results; see Exercises 1^® and 4*°. 



115 


INTRODUCTORY PROBLEMS 


§21 


The explicit form of (12) is consequently 

^ “ .^Jn Cn^n 


The problem of finding that solution of (9) which reduces to 
our &tt = 0 has now been solved explicitly: the desired solution 
is (17). The infinite series in (17) can, however, be readily 
summed. Since En = (n + i)hu, where w = (17) can 

be written as 



the summation in (18) is of the same form as that in (13) and can 
be eliminated by means of (13) with s = (xo/2a) exp (-tut); 
the result, when rearranged, is 


ip = 




Xo 


4a‘ 


tut 

1 


^0 2iwi 

4a^ 



iut 



(19) 


Lest the mathematical steps that resulted in (19) confuse the reader as 
to what the main object of our calculation has been, we shall repeat that, 
being given that at « = 0 the Schroedinger function describing the state of 
our oscillator is (1), we computed the Schroedinger function ^ describing 
the state at the time i. To do this, we had to find that solution of (9) 
which reduces to our at i = 0, and this solution, when simplified in 
appearance, turned out to be (19). The procedure used in finding ^ is 
really immaterial; were we able to guess that the solution of {9> which 
reduces to at i = 0 is (18), our aim would have been accomplished. In 
fact, we used the systematic procedure of §16, and, besides, the special form 
of our and the relation 43^ enabled us to carry out the calculations with 
relatively little labor and to obtain the solution in a closed form rather than 
in the form of an infinite series. 


The Schroedinger function for the time t now being known, we 
are ready to make calculations concerning the behavior of our 
oscillator at any time. We can, for example, compute the dis- 
tribution-in-a: of the particle at the instant t; multiplying (19) by 
its complex conjugate, we get 




(as--»oooa« 0® 


( 20 ) 



116 


THE SCHEOEDINGER METHOD 


§21 


This result is identical with (7), so that, for the special case under 
consideration, classical mechanics and quantum theory agree 
insofar as the probability packet describing the position of the 
oscillator at any time is concerned, a conclusion that appears 
remarkable in view of the wide differences between the classical 
and quantum-mechanical concepts and methods employed in the 
study of this problem. Note that in investigating the behavior 
of the a-packet classically we used the simultaneous distribution- 
in-aj-and-in-p of the oscillator, while in the quantum-mechanical 
treatment no recourse to such a simultaneous distribution was 
made. 

The quantum-mechanical distribution-in-p for the state (19) 
also agrees with the corresponding classical distribution, namely, 
(8) ; the proof will be postponed until we develop in §27 a direct 
method for computing distributions-in-p from Schroedinger 
functions. 

Although (19) presents ^ in a simpler form than does (17), the 
expression (17) is of greater interest in calculations concerning the 
energy of our oscillator, since (17) displays explicitly the expan- 
sion coefficients Cn . If a;o 5^ 0, the Schroedinger function (1) is 
not an eigenfunction of H, and consequently the energy of our 
oscillator is not a certainty; and, according to the theorem of 
§20, the probability that a precise energy measurement made at 
any time will yield the result = (n -f- ^)hve is 


, |2 _ (Wa)”* -Kt)' 


( 21 ) 


Exercises 

1. Verify by direct substitution that (19) satisfies (9) and that it 
reduces to (1) at £ = 0. 

2. Verify by a straighforward summation that (as is to be expected from 
the general result of Exercise 6®®) the probabilities (21) add up to unity. 

3. Assume that xo/a in (1) is large compared to unity and show that then 
the most probable result of an energy measurement is that value of En 

(n H- ii)hye which lies nearest to ikxl . Show also, using the first term of 
Stirling's formula for factorials of large integers [nl « -s/2irn n»‘e"'‘(l + 
n”V12 + n“V288 — •••)]> that the probability of the most probable result 
is approximately Tc^^alxo . Can two or more possible values of the energy 
of our oscillator be equally probable? 

4. Show that the average of the energy of an oscillator in the state (19) 

is ihve 4- ; note that this result is greater than jEo — ihPe , in agreement 

with the conclusion of Exercise 17^^. 



117 


INTRODUCTORY PROBLEMS 


§21 


5. Explain why the following argument is fallacious: t = ~ 

the distribution-in-a; (20) is identical with the distribution-in-a; 
for the normal state of the oscillator; hence the oscillator whose state is 
specified at i = 0 by (1) is in the normal state at ^ ~ and a precise energy 
measurement made at the instant £' is certain to yield the result Eq = ihue 

6. Consider the state obtained by letting a;o - 0 in (1); in particular, 
contrast the conclusions concerning the energy with the classical results of 
Exercise 4}^. 

7. Let the state of an oscillator be specifie d at t = 0 by the Schroedinger 

function (d7r0“* exp(— where d 9 ^ that is, d 9 ^ a. Show 

that then the values En= (n + i)hvc with n an odd integer cannot be results 
of a precise energy measurement. 

8. Show that the classical theorem of Exercise 5^^ holds in quantum 
theory. 

22. Coherent and Incoherent Superpositions of States 
Let us return for a moment to the state 

(9") ^ + 4s 

V2 V2 

Here ^ and (f-n are free from any arbitrariness, but the real 
constants Si and Sn are both arbitrary; in particular, the constant 
— 5ii I thie relative phase of the energy states ^i and. 'f'n > is 
arbitrary. A superposition involving arbitrary relative phases is 
called an incoherent superposition. Incoherent superposition 
arises in connection with states specified by sets of conditions less 
stringent than maximal sets. 

In dealing with incoherent superpositions, it is sometimes 
necessary to average over the arbitrary relative phases when 
computing theoretical data for comparison with experiment; to 
illustrate, we shall consider the distribution-in-a:ior the state (1). 
Recalling that - /i(a:)e"’^‘'* and fn = /ii(a:)e''’^“‘^*, where 
/i and/ii are free from any arbitrariness, we get 

# = ^(^1^1 + iAiifn) + exp i ^ 

+ 3ii — 5i^ 4- exp i " * + 5i — J- (2) 

The first term in (2) is independent of t, but the terms in brackets 
involve t explicitly; yet the question What is the distribution-in-a: 
when our laboratory clock reads <? cannot be answered with 



118 


THE SCHROEDINGER METHOD 


§22 


certainty because the relative phase 5i — Sn is involved in the 
bracketed terms in essentially the same way as t, and because the 
physical information that led us to adopt (1) as the proper Schroe- 
dinger function provides no clue as to what the value of di — 
should be. The dependence of (2) on t is therefore only formal. 
The best we can do in estimating the distribution-in-a; at the time 
t is to average the bracketed terms over all possible values of 
h — ^ii j regarded as equally likely. Now, the average of 
exp z'(5i — 5ii) calculated in this manner is just the average of all 
complex numbers of modulus unity (that is, zero), so that, upon 
averaging, the bracketed terms in (2) vanish. The estimate of 
the distribution-in-a; at the time t is consequently 

(3) 

and is independent of t. Note that (3) is just what we would get 
by using the quantum-mechanical Pi{x) and Pii[x) in the classical 
formula for P{x) given in Exercise 

Superpositions, such as in which the relative phases of the 
component states are fixed are called coherent^ for a state that is a 
coherent superposition of energy states of different energies the 
distribution-in-a: is necessarily nonstationary. Further remarks 
on coherent superposition will be made in §27. 

Exercise 

1. Discuss the distributioii-in-.r of the oscillator of Exercise 

23. Heisenberg Inequalities 

* 

In the preceding sections we illustrated how the physical 
properties of an oscillator can be investigated from the standpoint 
of quantum mechanics when it is known that the energy of the 
oscillator is a certainty, or when the information regarding its 
state refers to the values of x and p at a certain instant of time 
but does not provide the initial conditions of the motion with 
precision ; and we compared the quantum-mechanical results with 
those given under analogous circumstances by the classical theory. 
We shall finally consider the quantum-mechanical status of the 
problem of the motion of the oscillator when the information 
regarding its state provides the precise values Xo and po of the co- 
ordinate X and the momentum p at the instant to ; and we shall 
find that, in quantum mechanics, this fundamental problem of the 



119 


INTRODUCTORY- PROBLEMS 


§23 


classical theory of the oscillator is divested of physical content. 
Our arguments will not be restricted to the oscillator and will 
apply to any one-dimensional system. 

We shall begin by assuming that the following converse of 
theorem 3 of §17 is true: If the result of a precise measurement of 
the dynamical variable a is certain to be the number a, then the 
if-f unction describing the state of the system is an eigenfunction of the 
operator a belonging to the eigenvalue a. Now, the information 
that the values of x and p at the instant io are precisely and po 
means that a precise simultaneous measurement of x and p at 
this instant is certain to yield the results xq and po . In the light 
of the theorem stated above, the ^-function describing the state of 
the system at ^ == must therefore be a simultaneous eigenfunc- 
tion of the operators associated with the dynamical variables x 
and p; but according to Exercise 5® these operators do not have 
eigenfunctions in common because their commutator is a non- 
vanishing constant. Consequently, the statement that the 
position and momentum of a particle arci known simultaneously 
with unlimited precision cannot be translated into the mathematical 
terms of quantum mechanics, and, if wc are to regard quantum 
mechanics as a good physical theory, no physical contemt can be 
ascribed to this statement. 

The general argument just given relies on the theorem stated 
above in italics, a theorem, which we liavo not proved.^'’ Wc shall 
therefore present an alternative detailed argument, which the 
reader can verify step by stop, and which, incidentally, shows how 
near wo can come, so to speak, to a precise simultaneous specifica- 
tion of X and p in quantum theory. 

I^et a state of an arbitrary one-dimensional system at an 
arbitrary instant of time be specified by a normalized Schroedinger 
function so that at this instant 

ava: = J ^xxpdx slvx^ = (1) 

av p = —ih j avp“ = (2) 

We assume-' that all tli(^ integrals above exist for the ^ in question, 
so that each quantity oiiumcratod above has a definite finite 


We have, however, proved special cases in §18. 



THE SCHROEDINGER METHOD 


20 


§23 


lumerical value; integration by parts then yields the following 
elations: 



f = 

j dx = 
+ ^') dz = 




(3) 

(4) 

(5) 


Our proof will consist in writing down the obvious inequality 
(6) and deducing from it the inequality (13) that has an immediate 
physical significance;^’ more direct proofs of (13) are less ele- 
mentary. We begin with the inequality 


+ (ax + p + id)^ 1 ^ > 0 , ( 6 ) 


where is the Schroedinger function in question, and where a, /3, 
and 6 are real constants that for the present we leave arbitrary. 
We then rewrite (6) as 

+ aa:(#' -f- W) + + W) + iK'H' - W) 

-h 2a^3^ -I- 03’ H- «-’)# > 0, (7) 


integrate (7), and, using (3), (4), and (5), write the result as 
— j dx ~ ^ j ^dx — 2ib J dx a j ^c^^dx 

-t- 2a|8 1 ^x^dx + (f + S^) f ^dx > 0. (8) 


Next we use (1) and (2) to reduce (8) to 

h~’‘&Yp^-a + 2Sh~^avp + aaivx^ + 2aPa,vx + p’‘ + 5’‘> 0, (9) 


and finally assign the following specific numerical values to a, |3, 
and 3: 


1 _ _ _ av a: 

2A^x' 2 ^’ 



( 10 ) 


Equation (9) then becomes 

I (av p* - avV) - ^ + iJr^. (avx’ - av’x) > 0, (11) 


This proof is a modification, of that given by W. Heisenberg in his 
Physical Principles of the Quantum Theory^ page 18. The University of 
Chicago Press, 1930. 



121 

INTRODUCTORY PROBLEMS 

to 

CO 

that is, 


JL >0 

4A*x - ’ 

(12) 

and we 

conclude that 



> 

IV 

(13) 


Thus the quantity Ax Ap {the product of the respective inexacti- 
tudes of the coordinate x and the momentum p) for any state of an 
arbitrary one-dimensional system cannot be smaller than the universal 
constant h/^ir. Therefore, a verbal description of a state cannot 
be summarized by means of a Schroedinger function whenever 
this description implies that x and p are known simultaneously 
with a precision exceeding that allowed by (13), and, according to 
quantum mechanics, a state so described cannot be realized 
physically. 

The proof of (13) given above is readily extended from normal- 
ized to quadratically integrable provided the integrals in 
(1) and (2) exist, Schroedinger functions which are not quad- 
ratically integrable, and ones for which some of the integrals in 
fl) and (2) do not exist, are of course of no interest,m the present 
connection. 

Applying our results to the case of an oscillator, we conclude 
that the problem of the motion of an oscillator whose position and 
momentum at ^ are precisely cco and po cannot be properly 
stated in quantum tneory, and consequently, from the standpoint 
of this theory, has no physical significance. 

The inequality (13) — a Heisenberg inequality — is an example 
of the limitations that are imposed by the quantum theory on the 
precision with which the values of canonically conjugate pairs of 
dynamical variables can be known simultaneously. Among the 
Heisenberg inequalities referring to three-dimensional motion 
of a particle are 

Aa:Ap»>^, AyApy>A^ AzAp,>A, (14) 


Exercises 

1. Verify that if Aa; = 0 the inequality (9) can be converted into 
fi~ 2 ^ 2 p _{. (a av a: ■+■ |3)® — « > 0, 


( 15 ) 



122 


THE SCHROEDINGER METHOD 


§23 


and that hence Ap would have to be greater than any preassigned number. 
Thus the vanishing of is inconsistent with the finiteness of all integrals 
in (1) and (2), so that the substitutions (10) are permissible whenever these 
integrals are finite. 

2. Deduce from (9) that Ap cannot vanish if all integrals in (1) and (2) 
are finite. 

3. Verify that the quantum-mechanical values of ^xAp for the various 
specific states of an oscillator studied in this chapter satisfy (13). 

4. Consider the energy states of an oscillator and show that the state- 
ment, “The more accurately one knows the position of a particle, the less 
accurately one knows its momentum,'' is not always true, and consequently 
should not be made unless properly qualified. 

24, The Heisenberg Principle 

The fact that the Heisenberg inequalities follow directly from 
the assumptions of quantum mechanics shows that the quantum 
theory satisfies the requirements of a fundamental physical prin- 
ciple enunciated by Heisenberg, requirements that, in the light 
of the present-day knowledge, must be fulfilled by any theory in 
order that it may hold promise of being capable of correlating 
physical phenomena without the introduction of irrelevancies. 
The Heisenberg principle has to do with the simultaneous experi- 
mental measurement of any two canonically conjugate dynamical 
variables; if, for definiteness, these variables are taken to be the 
coordinate x and its conjugate momentum p, this principle can 
be stated as follows: it is impossible to devise an experimental 
procedure for the measurement of x and p that would yield simul- 
taneously absolutely precise values of x and p: the information 
provided by any experiment regarding the simultaneous values of 
X and p is always inaccurate to such an extent that Lx Ap > h/Air, 

To quote from. §12: ^^Any particular apparatus used for the" 
measurement of a;, say, will of course have its own limits of pre- 
cision, but, if greater precision is required, another, more delicate, 
apparatus can in principle be constructed. Classical mechanics 
implies that the improvement of the apparatus for measuring x 
can be carried on indefinitely so that the precision will exceed 
any preassigned standard, however fine. Similarly, the precision 
of apparatus for the measurement of p, or of apparatus for the 
simultaneous measurement of x and p, can, according to the 
classical view, be improved without limit.” The words 'accord- 
ing to the classical view’ now acquire special significance because, 
in contradistinction to the classical ideas, the Heisenberg principle 



123 


INTEODUCTORY PROBLEMS 


§24 


implies a fundamental limitation on the precision with which a 
simultaneous measurement of x and p can be made. 

Our belief in the correctness of the Heisenberg principle rests at 
present on the following foundations: 

(a) When any one of the experiments proposed so far for the 
simultaneous measurement of two canonically conjugate dynami- 
cal variables is studied in the light of the present-day knowledge 
of the properties of the physical agencies employed in the meas- 
urement, it is found that no precise simultaneous values of these 
variables can be inferred from the results of the experiment, and 
that the reliability of whatever inferences can be drawn is in 
accord with the Heisenberg principle. 

(b) The only theory known at present that has a claim to a 
general applicability to atomic phenomena— the quantum theory — 
is in accord with the principle. 

To illustrate the basis of contention (a) by means of an example, 
we shall now show that the measurement of the position of a 
particle by means of a microscope is accompanied by an alteration 
of the momentum of the particle, and that a precise correction 
for this alteration cannot be made; and wc shall also estimate the 
orders of magnitude of the attendant inexactitudes. In doing 
this we shall confine ourselves to the a:-components of position and 
momentum, the a;-direction being that from 
left to right in the plane of Fig. 26, which 
shows the direction of the light of w^ave 
length X used to illuminate a particle in the 
vicinity of the point 0; I and II are the 
extreme paths that the light scattered by 
the particle may take and still get through 
the microscope. 

The precision with which a microscope 

(.an yiold information regarding the posi- a microscope, 

tion of a particle depends on the resolving 
power determined by the lenses and their arrangement, by the 
angle C/, and by the wave length \i of the light scattered by the 
particle into the microscope; in particular, if perfect lenses are 
arranged in the most satisfactory way, then the Smallest interval d 
that can be optically resolved is 

d = — 

2sinr/' 



( 1 ) 



124 


THE SCHROEDINGER METHOD 


§24 


The inexactitude of the information provided by an actual micro- 
scope concerning the a^coordinate of the particle is therefore 
ai best of the order of magnitude of d: 


Ax ^ 


Xi 

2sin!7‘ 


( 2 ) 


Let us now suppose that at an instant to the particle was actually 
observed near the point 0, so that its position at that instant is 
known with a precision Ax. This means that at least one photon 
coming from the light source was deflected by the particle into the 
microscope. The path taken by this photon after it struck the 
particle cannot be determined, because the microscope is con- 
structed so as to focus the light coming from 0 regardless of the 
direction in which this light may have entered the microscope; 
hence, if the photon remained in the plane of Pig. 26, it may have 
taken any path between the extremes I and II. Now, as known 
from the studies of the Compton effect, the deflection of a photon 
by a material particle is accompanied by a change in momentum 
of both the photon and the particle; in particular, the change in 
the ^-component of the momentum of the particle is h(Xi - \ sin 
tOAXi, or approximately 




All - Sin U)/K 

if the photon took path I, and approximately 
A(1 + sin U)/\ 

if It took path II; here X is the wave length of the light used for 
illummation Consequently, the fact that an observation of the 

m™°t alteration of its 

momentum lymg between h(l - smV)/\ and A(1 -f- sin i7)/X 

was also made. The very construction of the microscope makes it 
molntm rf regurding the Momponeal of the 

XT u Ap, ~ 2AX-» Sin 17. /gx 


Ax Ape ~ h. 


( 6 ) 



125 


INTRODUCTORY PROBLEMS 


§24 


The argument resulting in (6) is in a sense a hybrid, since it uses 
the wave theory of light in (1) and the photon theory in (3) and 
(4); yet there is abundant evidence for the fact that (1) does give 
the maximum resolving power of a microscope, and that the 
Compton effect does take place when light is scattered by a 
particle. 

A microscope, of course, is not the only instrumerit that can be 
used for the measurement of position, but consideration of other 
types of apparatus shows again that a measurement of x with a 
precision Ax causes the information available regarding p* to 
become inexact in such a way that, after the position has been 
observed, AxAp^ > h/4cT; similarly, devices intended primarily 
for the measurement of px render the knowledge of x inexact. 
These matters are discussed very lucidly by Heisenberg in his 
book cited in footnote 13. 

With regard to contention (b) given above in support of the 
Heisenberg principle, we can but reiterate that the Heisenberg 
inequalities are, as we have seen, deducible from the fundamental 
assumptions of the quantum theory, and that hence the theory does 
not permit the translation into its mathematical terms of just the 
kind of information that, in the light of the Heisenberg principle, 
is to be regarded as not physically obtainable. All deductions 
from the quantum theory are thus bound to be in accord with the 
demands of this principle. 

Exercise 

1. Why cannot a correction for the alteration in px of the particle ob- 
served by means of a microscope be made, without affecting the accuracy 
of the knowledge of x, by measuring the wave length of the light coming 
out of the microscope at the instant to, and then computing the angle 
through which the photon was deflected? * 

26. Kennard Packets 

We shall next determine the form that the Schroedinger func- 
tion yp must have at a given instant of time in order that 
have at that instant its smallest possible value A/47r. The 
replacement of > by = in 13^^ is permissible if and only if it is 
permissible in 6^®, that is, if and only if \p satisfies the equation 

yp' -|- (ax + iS -f* i5)yp = 0, 


( 1 ) 



126 


THE SCHROEDINGER METHOD 


§25 


whose general solution is 

^ ( 2 ) 

If we choose ^ so as to normalize (2), the resulting ^ may be 
written as 

= + - iSx~^, ( 3 ) 


where 7 is real and independent of x, but otherwise arbitrary. 

A computation of &vx (= Xa)i avp (= pa), and A* for the state 
(3) shows that these quantities are related to a, j8, and 5 through 
Equations 10^^ so that (3) can be written as 






[.x—xa)^ , ipax 

e * . 


( 4 ) . 


The distribution-in-a: corresponding to the amplitude (4) is the 
Gauss distribution 

= 7= e (6) 

A, VSx 


aiid has a maximum at a: = a;o ; hence Xa is not only the average of 
X but also its most probable value. The distribution-in-p for 
the state (4) is (§27) 


1 

Aj,\/27r 


(j>— Po)* 


( 6 ) 


where 


Ap 


4:tAJ 


( 7 ) 


the distribution (6) is also Gaussian, so that pa is the average 
and also the most probable value of p. 

The Schroedinger function (4) thus represents a state for which 
the most probable value of x is a^a , the most probable value of p is 
Pa , and for which A^^Ap has its least possible value h/ir. In 
specifying the state of a one-dimensional system by the function 
(4) we consequently come, so to speak, as near to the specification 
of precise initial conditions of the motion (to which we are ac- 
customed in the classical theory) as is possible to do in quantum 
mechanics. For this reason, states that at some instant of time 
have the form (4) are of special interest; we shall call such states 



127 


INTRODUCTORY PROBLEMS 


§25 


one-dimensional Kennard packets}^ The packet for example, 
is a Kennard packet having xq for the most probable value of Xj 
zero for the most probable value of p, and aj's/^ for A* . 

The manner in which a Kennard packet varies with the time 
depends of course on the dynamical system under consideration, 
because the time dependence of a Schroedinger function is deter- 
mined by the equation Hyp = ihdxp/dt^ whose solutions depend on 
the explicit form of the Hamiltonian. In a linear restoring field, 
the Kennard packet 1^^ was found to vary with the time as shown 
by 19^S* in another field, the behavior of the packet would be 
different. The fact that the Hamiltonian must be known in 
order that the behavior of a Kennard packet (for that matter, 
of any Schroedinger function) may be determined corresponds 
to the fact that in classical theory the force field must be known 
before the motion of a particle can be calculated. 

Quantum theory does not provide for us an opportunity for 
employing in a strict sense the mental picture of a moving particle 
cultivated by daily experience and also by classical mechanics, 
the picture that associates with the particle a definite position 
and momentum at every instant of time and allows us to follow 
mentally the motion of the partible between observations. The 
best we can do in seeking a mental image of the motion of a 
particle that is justified by both the quantum and the classical 
theory is to turn to probability packets; but, while in the classical 
theory the probability packets describing x and p of a particle 
may be made more and more narrow without limit (so that in the 
end we get a precise specification of the simultaneous values of 
X and p), in. quantum theory we must stop with Ax Ap “ A/4 t, 
that is, with a Kennard packet. 

Exercises 

1. Show that 19^^ can be written as 

1 1 A r ~ cos wt)* 

^ ^ L 2^^ — 

. xxo sin coH .Qv 

^J, (8) 

and find by inspection av x and av p for this state for any time. 

1* E, H, Kennard, Zeits. f. Phyaik, 44, 326 (1927). Note that for brevity 
we use the term packet for both a distribution function and its amplitude. 



128 


THE SCHROEDINGER METHOD 


§25 


2. The quantum-mechanical problem that in a sense corresponds most 
nearly to the classical problem of determining the motion of a linear har- 
monic oscillator if the position and momentum of the particle at ^ = 0 are 
Xo and po is that of determining the motion, in a linear restoring field, of a 
Kennard packet that at i = 0 represents a state for which the most probable 
value of X is xo and the most probable value of p is po . In §21 we discussed 
the special case po - 0 and A® = mkA'i . 

Consider now the somewhat more general case when A® = 7nkAl, but 
po 9^ 0, Set up the appropriate Kennard packet for i = 0 to take place of 
the packet expand the new in terms of the eigenfunctions of the 
Hamiltonian of the oscillator, compute approximately the most probable 
value of the energy, obtain the time-dependent Schroedinger function and 
check it by means of H®, find the most probable values of x and p at the time 
t, and calculate the average of the energy. Point out whatever similarities 
exist between the quantum-mechanical results and the corresponding 
classical results of Exercise 3^®- 

3. Recompute the time-dependent Schroedinger function of Exercise 2 
by writing — r for t and x' for xq in (8) and adjusting x' and t in the result 
in such a way that for <' = 0 the x-dependent exponential in it equals the 
corresponding exponential in the Kennard packet set up for i = 0 in Exer- 
cise 2, Explain why we may expect this transformation to yield the proper 
result, and compare the values of x' and t obtained by this process with, 
those suggested by classical analogy. 

26. Suinmary 

We began this chapter by stating in §13 the operator assumption 
I, and then introduced rule II, which requires that quantum- 
mechanical operators be associated with dynamical variables 
according to the quantum condition 1^®. This assumption and 
this rule do not commit us to any specific mathematical form for 
the quantum-mechanical operators. 

In §14 we described the Schroedinger method of associating 
specific operators with dynamical variables, pertaining to one- 
dimensional systems and not involving explicitly the time t, and 
stated that the Schroedinger operands (the significance of which 
was as yet unclarified) are subject to a condition of good behavior. 
We also constructed the first Schroedinger equation 19^^ for the* 
case of one-dimensional motion. 

The Schroedinger machinery for computing energy levels of 
conservative one-dimensional systems was then complete, and we 
illustrated it in §16 for the case of a linear harmonic oscillator. 

To develop the technique for handling problems other than 
those concerning energy levels, we then introduced rule III®, 
which requires that the time dependence of the Schroedinger 



129 


INTRODUCTORY PROBLEMS 


§26 


operands be controlled by the second Schroedinger equation 1^®; 
and we showed how the Schroedinger function pertaining to the 
instant t can be computed from that pertaining to the instant U . 
Next we introduced the operand rule IV of §17, which implies, in 
particular, that a Schroedinger function is associated with a state 
of a dynamical system, and conversely. Finally, we introduced 
assumption V® (and its one-dimensional form Vi), which provides 
the Schroedinger expectation formula 1^^ (and its one-dimensional 
form 2^^). 

The Schroedinger expectation formula supplies a method for 
inferring detailed physical information concerning a djmamical 
system in a certain state from the Schroedinger function which 
specifies this state; it also supplies (at least in principle) a method 
for handling the converse problem, namely, the problem of identi- 
fying the Schroedinger function to be associated with a verbally 
described state. Three particular implications of the expectation 
formula, pointed out in §17, are: (a) if c is a numerical constant, 
then the Schroedinger functions ^ and specify the same state; 
(b) in the one-dimensional case, the distribution-in-o: for a state 
specified by the Schroedinger function is and (c) if the 
Schroedinger function yp satisfies the equation ayp = (where 
a is the Schroedinger operator associated with the dynamical 
variable a, and a is a numerical constant), then the value of the 
dynamical variable a for the state specified by this Schroedinger 
function is certainly a. 

In §18 we began the work of applying the Schroedinger method 
to a number of one-dimensional problems, the work that will be 
continued in the next four chapters. In particular, we derived 
the Heisenberg inequality 13^^ and, in §24, we discussed the 
Heisenberg principle. 

Exercise 

1. Show that, having adopted the expectation formula 2^^, we may, 
insofar as the one-dimensional Schroedinger method is concerned, omit 
from among our fundamental assumptions the second sentence of the opera- 
tor assumption I. 



Chapteh IV 


FURTHER PROBLEMS IN THE SCHROEDINGER 
METHOD 


27. Linear Momentum 

The operator associated in the Schroedinger scheme with the 
a:-component of momentum is 

<3“> -A A, (1) 

an operator closely related to 1®. For brevity, we shall speak of 
(1) as the ^momentum operator' or simply as the ^momentum,' 
rather than as the ‘operator associated in the Schroedinger scheme 
with the momentum'; the same term will thus be used for the 
dynamical variable and for the operator associated with it. 
Furthermore, whenever we shall not need to rewrite the operator 
(1) in full, we shall denote it by the letter p: 

p--iibA, (2) 

thus using the same symbol for the dynamical variable and for the 
operator associated with it. 

In Chapter III we considered the momentum of a linear har- 
monic oscillator for various states of this particular dynamical 
system. The form of the operator (1), however, is independent 
of the force field in which the particle may be moving; hence 
certain general properties of the momentum can be investigated 
without specifying any particular dynamical system. Inci- 
dentally, so long as we do not specify the force field, we cannot 
determine explicitly the time dependence of the Schroedinger 
functions, because doing this involves using the second Schroe- 
dinger equation, that is, the equation Hyf/ = ihbyp/bt\ hence our 
i/'-functions pertaining to an unspecified dynamical system will be 
instantaneous ^'s, that is, will refer to some particular instant of 
time. 

Let us consider first the possible results of a precise measurement 

130 



131 


FURTHER PROBLEMS 


§27 


of the momentum. According to the operator assumption of §13‘ 
these are the eigenvalues of (1), so that to calculate them we must 
determine those values of the number X for which the equation 
—ihdyp/dx = has well-behaved solutions. To suggest that 
the eigenvalues which we are seeking are the possible values of the 
momentum rather than of some other dynamical variable, we 
shall use the symbol p' instead of X and shall write the eigenvalue 
equation for the momentum as 

= pV (3‘) 

ox 

rather than as — = X^, with the understanding that the 

symbol p' stands for an ordinary number.^ Equation (3) can be 
contracted to pi/^ = p V, it being understood that p is the operator 
(1) and p' is a number. The general solution of (3), 

^ (4) 

where A is independent of x but otherwise arbitrary, is well- 
behaved if and only if p' is a real number, zero included. Con- 
sequently, every real number is an eigenvalue of our operator p, 
and we conclude that, so long as no restriction to any special 
dynamical system in a particular state is made, every real value of 
linear momentum is a possible one, just as in classical mechanics. 
This result is to be expected in the light of some of our previous 
results, such as, for example, the distribution-in-p for the normal 
state of an oscillator. 

The eigenfunction of momentum belonging to the eigenvalue 
p' is (4), so that, in view of the theorem 3 of §17, if the state of a 
system at a given instant is specified by the Schroedinger function 
(4), then a precise momentum measurement made at that instant 
is certain to yield the numerical result p'. Unless p' = 0, (4) 
is a complex function of a;, and it is impossible to plot it against x 
as abscissa by means of a single line in a plane; an eigenfunction of 
momentum is thus somewhat difficult to visualize. We speak 
of a state described by a Schroedinger function of type (4) as a 
momentum state. 

It follows from (4) that 

# = I A (5) 

1 Thf‘ symbols a', and so on, are often used for the possible values of a 
dynamical variable a. 



132 


THE SCHEOEDINGER METHOD 


§27 


so that for a momentum state the distribution-in-x is a constant 
independent of a;; that is, precise knowledge of v implies complete 
ignorance of x. The graph of (5) is exactly like that of Fig. 13 ; 
but, while in obtaining Fig. 13^^ in the classical theory we de- 
liberately disclaimed all knowledge of position, in quantum 
mechanics we are compelled to disclaim all knowledge of position 
as soon as we claim a precise knowledge of the conjugate momentum. 

The choice of the constant A in (4) is of course arbitrary, 
especially since (4) cannot be normalized to unity. Our discussion 
will, however, be made more concise if we agree to refer to the 
function® 

( 6 ) 

obtained from (4) by letting A = 1, as the Schroedinger function 
specifying the state for which the momentum of the particle is 
certainly p', or as the eigenfunction of p belonging to the eigen- 
value p\ 

By letting p' in (6) take on every (real) value between — oo and 
we obtain the infinite set of momentum eigenfunctions. This 
set is a continuous one, and is to be contrasted with, say, the discrete 
set of the eigenfunctions of H of the oscillator listed in 7^®. 

We shall now proceed to show that a large class of functions ^ 
can be ^expanded^ in terms of the momentum eigenfunctions in 
the form 




( 7 ) 


and that the distribution-in-p for the state ^ can be inferred from 
the expansion coefficients, that is, the c’s in (7). 

Fourier integrals. To pave the way for expansions of the form 
(7), we return to the set 


(31‘) 


,- 2 ,- 1 , 0 , 1 , 2 , 


( 8 ) 


which is orthonormal in any interval of length L. If f(x) can be 
e^anded in terms of this set in the interval (- JL, ^L), so that, 
in this interval, < > 


fix) = 


(9) 


* Since (6) is an amplitude of 
sions are not of importance. 


a relative distribution, its physical dimen* 



133 


FURTHER PROBLEMS 


§27 


then, in view of 21®, the expansion coefficients are 

c„ = (10) 

Using s instead of x for the variable of integration in (10), we put 
(9) into the form 

fix) = L-‘ Zn [ jf /(s) ds] (11) 

Since the right side of (11) is periodic in x with period L, the 
equality (11) holds outside the interval (— \L) only if /(a;) is 
also periodic in x with period L: a nonperiodic function cannot 
be represented by a Fourier series in the infinite interval. 

We shall now indicate, by allowing L in (11) to grow large 
without limit, how a nonperiodic function can be expressed in the 
infinite interval in a form resembling (11). Let y, = n/L; then, 
as n runs through its successive integral values, u increases at 
each step by the amount Lu = so that (11) can be written as 

... . ( 12 ) 

If L in (12) is allowed to increase without limit, we obtain the 
improper integral with the limits — oo and ; further, an integra- 
tion with respect to u can be substituted for the summation, and 


we get 

/(a:) = [ J" /(s) ds] e*”'“ du, 

(13) 

that is, 

fix) = y°°c„e®"“"du. 

(14) 

where x 

ranges over the entire infinite interval ( — oo , 

oo), and 

where 

Cu = y f(x) dx. 

(15) 


In view of (6), Equation (14) is of the same form as (7), so that 
we are getting nearer to the solution of the problem of resolving a 



134 


THE SCHROEDINGER METHOD 


§27 


state in terms of momentum states. Equation (13) is one form 
of the Fourier integral theorem.^ • 

The quantity Cu given by (15) is a function of u] its value for 
14 — is the weight, so to speak, with which the function 
enters into the expansion of /(a;) in terms .of functions of the form 
^tiux expansion (14) is analogous to that 

of the whole list of the coelBBcients • • • , c_ 2 , c-i , Co , Ci , C 2 , • • * 
in the expansion (9). 

Let / - f{x) and g = g(x) be any two functions that can be expanded into 
Fourier series in the interval (—il/, JL) : 

/- M= •■•,-2, (16) 

g = £»i m = • • ■ , -2, -1, 0, 1, • • • (17) 

and let the Fourier series for the function /(a;)^(a?) be 

*= •••,-2,-l,0,l, ••• . (18) 

The a*8f 6’s and c’s can all be obtained by means of 21®; in particular, 

fiL 

Co « L"i I fgdx, (19) 

• J—i^ 

SO that the term in (18) that does not depend on x is 

riL 

L-i / fgdx. (20) 

J-kh 

By direct multiplication of (16) and (17) we get 

fg - L-i Zn Zm o„ 6. (21) 

so that the constant term in (IS) can also be found by summing the right 

side of (21) under the condition n + m = 0, the result being 

I--* Z. «-»&«• (22) 


Equating (20) to (22), and using 21* to eliminate o_„ and bm , we get 










If we now write $ = m/L and allow L to become infinite, (23) goes over into 

j f(x)g(x)dx J da; J ds. (24) 


* For the conditions that must be satisfied hyf(x) in order that (13) may 
hold rigorously, see a mathematical discussion of Fourier integrals. 



135 


FURTHER PROBLEMS 


§27 


J^ow, let g in (24) be the A;th derivative of some function (which we shall 
denote by g in order that our final result have a convenient form), and let 
the new g and its first fc — 1 derivatives vanish when | a; | -* «o. Equation 
(24) then becomes, 

[/„ - \_f_^ h ^ 

and the second bracketed factor in (25) , when integrated hy parts k times, 
becomes (27rte)* / The final result, namely, 



d** 


g{x) dx 
On)” 


‘-/I/: da? J ^ J (26) 


will find immediate application in our work . 

Amplitudes of distributions-ia-p* If a normalized Schroedinger 
function ^ describing the state of an oscillator is expanded in 
terms of the normalized eigenfunctions of H in the form 

(27) 

then, as shown in §20, the probability that the energy for the 
state ^ be J?« is 1 Cn P, so that Cn is an amplitude of the probability 
that the energy for the state ^ be J£^n . We may, by analogy, 
expect that if a Schroedinger function ^ is expanded in terms of 
the eigenfunctions of p in the form 

(7) r = r (28) 

J—oo op 

where for simplicity p is written for p', then , a function of p, 
should be proportional to the amplitude of the probability that for 
the state ^ the momentum lie in dp at p. This surmise is a correct 
one, as we shall now show. Since p ranges from — oo to <» , the 
symbol d in integrals with respect to p will imply that tho limits 
of integration are — « and «> . 

Let the state of a one-dimensional system be specified by a 
Schroedinger function \[/ which, together with all of its derivatives, 
vanishes when [ a; | --> « and which can be expressed as a Fourier 
integral: 



( 29 ) 



THE SCHEOEDINGER METHOD 


136 


§27 


where, according to (15), Cu = j H dx. Letting m = p/h = 
p/2^h in (29), we get 

^ = / hr^c^e'’^‘^dp, (30) 

where 

Cp= j ie~'^'’^dx, (31) 


SO that rl/ has been expanded in terms of the momentum states. 
We expect then that the amplitude [which we denote by ■4^(p)] 
of the distribution-in-p for the state ^ is proportional to (31) and, 
choosing the constant of proportionality to be A”*, we write 
tentatively 


Mp) = h ^ 





(32) 


Having arrived at a plausible expression for i4^(p), we proceed 
to prove that it is consistent with our fundamental assumptions. 
According to the expectation formula 2^^, the average of p* for 
the state specified by the Schroedinger function \p is 


/ (33) 


Now, if the amplitude of the distribution-in-p is A^,(p), then the 
^tribution-in-p is A^(p)A^{p), so that, if A^ip) is available, then 
the average of p for the state calculated by means of 27“, is 

f Mp)p^ A^ ip) dp ^ J A^ip)A^{p)dp; (34) 


the factor m the integrands of (34) are arranged in our customary 
order, although m the present case the order does not matter. To 

by ( 32 ) the 

imencal value of (34) is, for every positive integral value of k, 
the same as the numerical value of (33). 

We write g = hT^^j = j = ^ ^ ^^g), and get" 




pdx 





‘ It is of course permissible to write d/dx for d/dz in (26). 


( 35 ) 



137 


FURTHER PROBLEMS 


§27 


so that for given by (32) the numerators of (33) and (34) 
are equal; the case fc = 0 in (35) shows that the denominators of 
(33) and (34) are also equal. Consequently (32) yields the correct 
values of av^ and its validity is established : the auiplitude of 
the distribution-in-p for a state of a one-dmiensional system described 
by a Schroedinger function is given by (32) . 

The formula (32) enables us to transform, so to speak, the 
amplitude of the distribution-in-a; for a given state into the ampli- 
tude of the distribution-in-p for the same state; we therefore call 
it a transformation formula. 

Note that multiplication of the right side of (32) by a factor 
of the form where (7 is a constant and r is a real function of 
p and does not affect that value of (34), so that we have not 
shown that (32) is the only suitable expression for .A^(p). But a 
closer study, which we shall not undertake here, shows that the 
multiplication of the right side of (32) by a function of p or t is not 
permissible, and that consequently our result is indeterminate 
to the extent of only a trivial constant factor. The factor hT^ in 
(32) is convenient because, as was shown above, it makes the 
numerators of (33) and (34) precisely equal to each other, so that, 
in particular, if f\l/ is an absolute distribution-in-a;, then A^(:p)A^{p) 
is an absolute distribution-in-p. 

Examples. To illustrate the applications of the transformation 
formula (32), we shall now calculate the distribution-in-p for the 
normal state of an .oscillator. This distribution was found in ■ §18 
by a method that involved a comparison of the averages of the 
powers of p with those of x, a method successful only because of 
the symmetry of the Hamiltonian of the oscillator in x and p; 
the present method, on the other hand, will not involve the 
computation of any averages. 

According to (32), the amplitude, which we denote by Ao(p), 
of the distribution-in-p for the normal state of the oscillator is 

Aoip) = (36) 

where is the Schroedinger function for the normal state, given in 
the normalized form by 12“ ; that is, 


Ao(p) 




y/ ah 


rtp ot r 

P" J 





(37) 



138 


§27 


THE SCHROEDINGER METHOD 


Integrating with the help of the formula 23“ we get 

<Y 0 

= (38) 

yJOTT 

where b is the constant ^kmh^ == i^/ a, first introduced in 25 . 
The distribution-iii“P for the normal state is thus the product of 
(38) and its complex conjugate, and we have 

1 -('-V 

Po(p) = io(p)io(p) = i- 7 =« > (39) 

b\'T 


in agreement with 33“ The present calculation is of course much 
more straightforward than that of §18. 

The amplitude of the absolute distribution-in-p for the nth 
energy state of the oscillator is, according to (32), 

AM = h-*j (40) 


where is 7“; integration yields' 


An(p) 


^ ■ U 

\/ 62 »n!T» “ 



, 2 V 6 / ^-iSntlh 


(41) 


The absolute distribution-in-p corresponding to (41) is just 38^®. 

Further examples. The validity of (32) was proved above 
only for severely restricted ^-functions, and the question as to 
whether or not this transformation formula holds when some of the 
restrictions on \p are removed has been left open. We shall, 
however, use (32) even if ^ is not of integrable square, although 
we shall then have occasionally to use mathematical methods 
formally rather than rigorously. 

To illustrate, let us see what kind of a result is given by (32) for 
the state specified by the Schroedinger function (6), that is, for a 
state for which the momentum is certainly p'. We have 


A(j>) j dx = h-^ f (42) 


Tbe integr&I is indetenninftte for p ^ p' and is infinite for p — p' l 
but it will be shown in Exercise 4 that setting / die equal to 

A8(p' - p) is consistent with the definition of the 8-function. 


‘ See, for example, G. A. Campbell and R. M. Foster, Fourier Integroda 
for PracHcdl ApplicaHove, page 14. New York: Bell Telephone System, 



139 


FURTHEE PROBLEMS 


§27 


Since S(j>' — p) — S(p — p'), we then get from (42) 

A(p) = h*d{p - p'). (43) 

The distribution-in- 2 > for our momentum state is thus hd(p — p') 
— p')) which we denote by hd^(p — p'); it vanishes for every 
value of p except p', for which it is infinite. This type of result 
is of course to be expected, since we are trying to represent a 
discrete distribution by a continuous distribution function. At 
the end of §12 we concluded that, if the value of a quantity is a 
certainty, then the absolute distribution in this quantity can be 
taken to be a 6-function; in the present calculation, the amplitude, 
rather than the distribution itself, is a 5-function, so that the 
resulting distribution must be interpreted as a relative one. 

As a further illustration of the use of (32) when ^ is not quadra- 
tically integrable, let us consider the case when the state of a 
system is specified at a given instant by the Schroedinger function 

^ (44) 

where the numbers p' and p" are not equal to each other and 
where the c^s are known constants, and let us calculate the dis- 
tribution-in-p for that instant. Since p' p", our ^ is not an 
eigenfunction of the momentum operator, and the structure of 
(44) suggests that the possible values of p are p' and p", that their 
respective relative probabilities are | Ci and |c 2 1^, and that 
hence their respective absolute probabilities are \ Ci \^/ (] ci 1* + 

|c2r)andlca|V(lcxr-hlc2r). 

Substituting (44) into (32) and treating the resulting integrals 
in the same way as we did (42), we get 

A,ip) = hr* f (cie’”'*'* + 

- CiK*S{p - p') + ciA*«(p - p"). (45) 

The distribution-in-p is the product of (45) and its complex 
conjugate, and since 6(p — p')5(p — p") = 0 when p' 9 ^ p", 
we have 

p*(p) = 1 cx r hfip - po + 1 Ci 1* A«*(p - p"). m 
This result implies just what we expected in the first place: at the 
instant at which the Schroedinger function is (44), the possible 
values of p are p' and p", and their relative probabilities are 
1 Cl I* and 1 Ci 1*. 



140 


THE SCHROEDINGER METHOD 


§27 


, , .-*1 ^ - 1 A, I -we get the following 

JjSke a. wbi=h the Schroe- 

dinger function is (44): 

^ = j + Ic, p + 2lci|.l cl cos ' ^V' 

its form is shown in Fig. 27. The distribution-in-a: is P^ic in 
X, period h/\p' -t"\) values of x are equally likely. 



Fig. 27. Distribution-in-a; for a state of the form (44). 

A Schroedinger function of type (44) is ype = cos kx. We have 

yPe - COS kx (48) 

so that the possible values of p are kh and --khj and these values are equally 
probable; the expected average of the momentum is therefore zero. An- 
other function of the same type is 

<!>. = sin kx «= • (49) 

2i 2 ^ 

the distribution-in-p is the same as for (48), but the two distributions-in-a? 
are different. 

The Schroedinger functions (48) and (49) may serve to illustrate a curious 
feature of coherent quantum-mechanical superposition. The i^-f unction 
(48) implies that the momentum is not certain; the ^-function (49) also 
implies that the momentum is not certain; yet the ^-function 

^ (50) 

that is, a special coherent superposition of (48) and (49), implies that the 
momentum is a certainty I Examples of this kind show that, although the 
discussion of systems whose preparation involves* the tossing of a coin 
(for example, a discussion along the lines of §19) may be helpful in giving a 
beginner a start toward understanding quantum-mechanical superposition, 
such discussion loses its effectiveness when the superposition is coherent, 



141 


FURTHEE PROBLEMS 


§27 


because neither the classical theory of probability nor the classical theory 
of particle motion can serve as a starting point for a discussion of the 
physical meaning of the phases of probability amplitudes. To be sure, an 
analogy to the quantum-mechanical superposition of states can, as we shall 
presently see, be found in the classical theory of wave motion; but this 
analogy is one of mathematical method rather than of physical content. 

Notation. In dealing with one-dimensional systems, we have 
used the symbol for a Schroedinger function, with the under- 
standing that ^ is a function of x and t, interpreted as the amplitude 
of a distribution-in-x; but when we consider concurrently ampli- 
tudes of the distributions of more than one variable, say of x and 
of p, as we have been doing in this section, the following notation, 
due to Dirac, is usually more convenient. 

Instead of denoting a Schroedinger function by we denote it 
by {x 1 ), the symbol (x | ) being read line.” Instead of 
denoting the complex conjugate of the Sc hroe dinger function by 
we denote it by ( | re), so that ( 1 a;) == (a; j); the symbol ( |a;) is 
read ^*line, x.” The corresponding distribution-in-x is then 
denoted by ( 1 a;)(x 1 ) instead of 

The spaces to the right and to the left, respectively, of the 
vertical lines in (a; 1 ) and ( \ x) are reserved for whatever symbols 
may be required to particularize a state. For example, we write 
{x I fc) for xpk f the symbol {x [ k) being rea d *^a;, line, fc” ; is 

then denoted by {k\x)j so that (fc | x) = {x\k), and is 

written as (k\x){x\k). 

To illustrate, we rewrite in the new notation some expressions 
taken from the preceding pages. The Schroedinger equation for 
the linear harmonic oscillator: 

(3“) + I {x\) = E{x\ ). (51) 

The normalized Schroedinger eigenfunctions of the Hamiltonian 
of the oscillator: 

(7‘®) (;e I n) = (62) 

The eigenfunction of the momentum belonging to the eigenvalue 
p' of p: 


( 6 ) 


(x 1 p') * 


(53) 



1^2 THE SCHEOEDINGER METHOD 

The complex conjugate of ( 53 ): 

(p' j x) = 

The dietribution-m-* for the nomral etete of the oacfflotor: 

(14") 


(0|a;)Ca:|0) = -i-e~^^^ 

a\/x 


§27 

(54) 

(55) 


variablef^ther thLj “ dynamical 

For example, when the amnlit ^ ^ analogous manner, 

etete ie 4 o ted by S ‘ ,? for . 

for tire sa.e eUte 1 S:IiXTl f H ol 
the distribution-in-p is denoted bv’ ^ ^ I 

amplitude of the distribution-in a: fL ^ 

then the ampUtude of the distribution state is (xjk), 

of this amplitude and conjugate 

denoted, respectiwly, by (p I ”) *r^^® 

Thus we denote the amnlitudJ J' + 1 , ^ I 

normal state of the oscillator b^ (p 1 0 ); for the 


(38) 


p*ro 


(p| 0 ) = _^g 

"v/hirt 






(56 


V y ^ 

“l&hTStulot "S ? ‘ 

(p I 0 ): «®nDunon-m-p for the normal state by (0 | j 


(39) 


(0|?>)(P'|0) = -i_e (0 

b-\/ir 


(57) 


The transformation formula f 32 '> tirro w, ± ■ 

ro»ultofthie,eetion,nowtakos„SfX 


(32) 

Bxercises 


(pl) 




(z I )e~'^’‘"'dz. 


(58) 


.f r, .J.d * w ,. e 



143 FURTHER PROBLEMS §27 

2 . Show that, if 


P(m) = 

rf(s)e-^^''“ds, 

(59) 


l—eC 



rcc 


/(«) = 

/ du, 

f—OO 

o 

CD 


and conversely. The process of deducing one of the relations (59) and (60) 
from the other is called inversion of a Fourier integral. Two functions, 
/(«) and g(u)y related to each other through (59) and (60) are called Fourier 
transforms of one another. 

3, Derive the following theorems from (59) and (60) : (a) if one of the two 
functions / and g is identically zero, then the other is also identically zero; 
(b) if one of the two functions /and ^ is even,® then the other is also even, 
while if one of them is odd, then the other is also odd; and (c) if both / and 
g are real, then both are even. 

4. Recall 26i2and verify that 6 ®*“****° = Cu du, with Cu == S(u — uo); 

this formula means that, when is expanded in terms of the continuous 

set of functions then the coefficients of the functions for which u p^uo 
are all zero, while the coefficient of the function is so great that the 

integral is firjite even though the integrand vanishes everywhere except at 
the single point w ~ wo . Then show by inversion that 





6(u — uo). 


(61) 


Try to evaluate the integral in (61) directly, and discuss the rigor of (61). 

6 , If we try to compute av p* for the state (44) by means of the expecta- 
tion formula 2^, we get meaningless integrals. But show that, if the 
integrals in 2^"^ are evaluated between the limits —I and I (rather than 
— 00 and 00 ), and av p* is taken to be the limit of the quotient of the two 
integrals as i oo , then the result is in agreement with that implied in (46). 

6 . Determine the absolute distributions-in-p, and plot rough graphs of 
the distributions-in-a; implied by the following Schroedinger functions; 

(12 -h H- (16 — 12 f)c^***; ^ sin* kx; - cos® kx. 

7. The only information available concerning a particle is that at a 
certain instant its momentum is either p' or p'', the probability of the 
first alternative being 0.36 and that of the second being 0.64. Write down 
the Schroedinger function for this instant and show that in view of the 
incoherency of the superposition all values of x are at that instant equally 
likely. 


® To say that /is even means that/(— s) = /(s); to say that g is odd means 
that g{—u) = ~ 0 ^(w); and so forth. 



144 


THE SCHROEDINGER METHOD 


§27 


8 . Show that if the Schroedinger function at a certain instant is the 

Kennard packet 4*®, then the amplitude of the distribution-in-p at that 
instant is exp [ — (p — pa)^/4Ap — ipxa/h]j where 71 = 7 + 

and Ap = ?i/2A» . Verify that is the distribution-in-p corresponding to 
the i/'-function 4“^. 

9. Show that the distribution-in-p for the state 19^1 is 8 ^^. 

10. Show that the condition that the Schroedinger function at a given 
instant be (4) is not only sufficient but necessary in order that the momen- 
tum of the particle at that instant be certainly p', 

roo 

11. Derive the transformation formula (a? | ) = / (p 1 


28. The Free Particle in One Dimension 


The only specific dynamical system that we have considered so 
far is the linear harmonic oscillator; the next one will be the free 
particle in one dimension. If a particle of mass m moves along 
the x-axis without being subjected to any forces, the Hamiltonian 

function is ff = 4- Fq , where Vq is an arbitrary constant 

that can without loss of generality be set equal to zero. The 
operator associated in the Schroedinger scheme with the resulting 
H is 


1 

9x2^ 


< 1 ) 


where, as before, k == 2m We shall, for brevity, often call the 
operator associated with the Hamiltonian of a given system the 
Hamiltonian operator of the system, or simply the Hamiltonian. 

To compute the possible values of the energy of a free particle, 
we must find the eigenvalues of (l), that is, the values of the 
number E for which the equation 




( 2 ) 


tlie first Schroedinger equation for the free particle, has well- 
behaved solutions. Equation (2) is of the form 5* and has well- 
behaved solutions if and only if 

E>0, (3) 


so that the eigenvalue spectrum of our Hamiltonian consists of all 
positive numbers, zero included. We conclude that (when Fo = OO 



145 


FURTHER PROBLEMS 


§28 


any positive value of the energy, zero included, is a possible value 
of the energy of a free particle, and that no other values are 
possible. This result is in agreement with the corresponding 
result of the classical theory of a free particle: the energy of a free 
particle is not quantized. 

The eigenfunctions of H belonging to the eigenvalue E have the 
form a exp {i\/lS a;) + 6 exp (—z-s/kE x), that is, 




xlh 


where a and b are independent of Xj but are otherwise arbitrary; 
(4) is the general solution of (2), except when JS = 0. 

To compute the properly time-dependent Schroedinger function 
for a free particle of energy E, we substitute (4) into the second 
Schroedinger equation (which in the case of an energy state 
reduces to Exp = ihd\l//dt)j and find that this function, to be 
denoted by \ps , is 


xj/jg = A,e 




where A and B are independent of both x and i, but are otherwise 
arbitrary. Every nonvanishing positive value of is a doubly 
degenerate eigenvalue of H, since the two special eigenfunctions 


iy/imS xfh ^iEl(h 


xlk ^iStlh 


of which (5) is a linear combination, are linearly independent when 
E 7 ^ 0. The energy spectrum of a free particle is continuous and 
degenerate, and the eigenfunctions of H are not quadratically 
integrable; for these reasons, the theory of a free particle is in 
some respects more involved than the theory of a linear harmonic 
oscillator. 

Comparison of (6) and (7) with 4^ shows that each of these 
functions, besides being an eigenfunction of our H, is simultane- 
ously an eigenfunction of p, and that (6) belongs to the eigenvalue 
y/2mE of p, whi le (7) belongs to the eigenvalue —y/2mE of p; 
the values ± y/2mE of p (corresponding, respectively, to right- 
ward and leftward motion) are just the possible values of p 
of a classical free particle of energy E. The general energy 
state (5) of a free particle is thus a superposition of two momen- 



146 


THE SCHROEDINGER METHOD 


§28 


turn states; in fact, the possible results of a precise measurement 
of the momentum of a free particle whose stat e is s pecified by the 
Schroedinger function (5) are \/'2mE and —■s/2mE, the respective 
relative probabilities of these results at any time being | |* 

and ; f, that is, | A f and j S p. The distribution-in-a:, 

, for the state (5) is stationary and has the general form 
shown in Fig. 


We saw in §18 that the knowledge of the energy of a linear 
harmonic oscillator with certainty leaves the Schroedinger function 
specifying the state of the oscillator luidetermined only to the 
extent of an arbitrary multiplicative constant, and that con- 
sequently the condition that the energy of the oscillator is a 


certainty constitutes a maximal set of conditions on the oscillator: 
if the statement that the energy of the oscillator is a certainty is 
made, no relevant information, not already implied in this state- 
ment,^ can be added to it. In the case of a free particle, the 
situation is different on account of the degeneracy of the energy 
spectrum; the knowledge that the energy of a free particle is a 
certainty does imply that the Schroedinger function is (5), yet it 
leaves both 1 and S in (5) arbitrary. This means that the con- 
dition that the energy of a free particle is certainly E does not 
constitute a maximal set of conditions when E > 0, and can be 
suppemented by further information concerning the particle, 

cert^y E E> 0 may be accompanied by the statement that 

+ v'SSi in this 

that the momentum rfTtreelLttde'S a ** 

set, sin^ it the J 

n.oem^„»rt.inh,_p.. The «m.ction (8) nan be written as 

Sribs,’ W. eormider n«ri how the Schroedtage, 

’ fro.. Boy. 8oc., 117A, 2SS (1927). 



147 


FURTHER PROBLEMS 


§28 


function 4 ' describing the state of a free particle at the instant t 
can be found if it is known to be at < = 0; in the field-free case, 
the second Schroedinger equation 1^® has the form 


ih a®, a , 

2S 53'^ “ 


C8) 


so that, stated in mathematical terms, our problem is to find that 
solution of (8) which at f = 0 reduces to the given If is an 
eigenfunction of H belonging to the eigenvalue E, then, iis has 
been shown, 4 / is simply 4 /°e~'^‘'^; but, if the energy of the particle 
is not a certainty, 4 ' is more complicated. 

In §16 we found that the solution of the corresponding problem 
for the case of an oscillator can be obtained by first resolving 4 / in 
terms of the eigenfunctions of H referring to t = 0, and then 
attaching the appropriate time factor to each term in the sum- 
mation. Essentially the same procedure can be used in the 
present case, except that: (a) since to all but one eigenvalue of H 
there belong two linearly independent eigenfunctions, the simul- 
taneous eigenfunctions of H and p, rather than the eigenfunctions 
of H alone, are to be used in the expansions; and (b) integrals 
must be employed instead of sums. The functions that will play a 
fundamental part in our calculation are thus the functions (6) and 
(7), which all have, apart from a multiplicative constant, th(( form 

(9) 

where p is the numerical value of the momentum (positive, 
negative, or zero) and where 4 /p is used for brevity instead of the 
more explicit symbol 4'b.p . For i = 0, the functions (9) become 

( 10 ) 

We now proceed formally in a manner parallel to that of §16. 
First, we expand the given in terms of the (continuous) set (10) : 

4'^ = J Cp4/l dp-, (11) 

Equation (11) is analogous to 13'®. Secondly, wc substitute 

given by (9), for 4'p in (11), and denote the resulting function of 
X and t by 

4' = fcp4/pdp; (12) 

Equation (12) is analogous to 14“ More explicitly, according to 



148 


THE SCHROEDIiSTGER METHOD 


§28 


(10), 30^’, and 31^, Equation (11) is 


while (12) is 



(13) 

(14) 


At i = 0, the right sides of (14) and (13) are the same, and 
consequently it remains only to show that (14) satisfies (8). 
Now, differentiation of (14) once with respect to t under the- 
integral sign introduces the factor in the integrand of 

the integral with respect to p; differentiation® of (14) twice with 
respect to x under the integral sign introduces the factor (ip/ft)^ 
in the integrand of the integral with respect to p; and, since 
{ih/2m) {vptfCf = — ipV2mA, (14) is a solution of (8). Our 
problem has been solved: if the Schroedinger function describing 
the state of a free particle is a.t t = 0, then it is (14) at the 
instant t. 

The case when is a discrete superposition of momentum 
states, that is, has the form 


is a special case of the continuous superpositions considered 
above; but it can be treated without reference to (14), since we 
can readily verify that the function 

^ + . . . , (16) 

where - (p')V2m, = (p'OV^^?^, and so forth, satisfies (8) 

and reduces to (15) at ^ = 0. 

Constancy oip. According to classical mechanics, the momen- 
tum of a free particle is a constant of motion, so that if it is 
certainly p (that is, if it certainly has the numerical value p) 
at the instant i = 0, then it will continue to be certainly p for all 
time; or, if the momentum of a classical free particle at < = 0 
is not a certainty, then the distribution-in-p at any subsequent 
time is the same as it is at i = 0. The situation is similar in 
quantum mechanics. 

In the first place, if the momentum of a free particle at i = 0 


* Note that the bracketed part of (14) is independent of x. 



149 


FURTHER PROBLEMS 


§28 


is certainly 'Pj then the Schroedinger function at i = 0 is a con* 
stant multiple of (10), and the time-dependent Schroedinger 
function is a constant multiple of (9) ; and, since (9) is at any time 
an eigenfunction of momentum belonging to the eigenvalue p, 
it follows that, if the momentum of a free particle is certainly 
p at ^ = 0, it will continue to be certainly p for all time. 

’Secondly, if the momentum of a free particle at i = 0 is not a 
certainty, so that the Schroedinger function for the instant t has 
the general form (14), then by inversion (Exercise 2^^) we get 

[| = J (17) 

Now, accordmg to the transformation formula 58^^, the right side 
of (17) is times the amplitude, (p | ), of the distribution-in-p 
at the time i, while the bracket on the left side of (17) is times 
the amplitude, (p 1 )°, of the distribution-in-p at the instant 
t = 0*; consequently 

(Pl) = (18) 

SO that the amplitude of the distribution-in-p for any state of a 
free particle depends on t in a rather simple way. The distribu- 
tion-in-p at the time t is 

( I P)(P 1 ) = ( 1 P)"(P 1 )“, (19) 

and is always the same as at i = 0. The stationary character 
of the distribution-in-p for an arbitrary state of a free particle, 
expected in the light of classical analogy, has thus been proved. 

Exercises 

1. Show that the distribution-in-j& for any state of a free particle is 
stationary. 

2. In the text we let the potential Vo , describing the force-free case^ 
be zero.® Consider the case 7o = constant 0. 

3. Set up the Schroedinger function which implies that the energy of a 
free particle is certainly zero. Consider its time dependence, and the 
information it contains concerning the momentum. 

4. Show that if the state of a free particle is specified at < - 0 by the 
Schroedinger function = A sin* kxj then the time-dependent ^-function is 

i/ = + iA, (20) 

where Eh =* 2A;* h^/m. 


® The condition To = 0 is implied also in the exercises below. 



150 


THE SCHROEDINGER METHOD 


§28 


6. Find by inspection the distribution-in-j) for the state (20); then 
show that the possible values of the energy of a free particle in the state 
(20) are Eh and 0, and that the respective probabilities of these values 
are i and 

6. Let the state of a free particle by specified at i = 0 by the ^-function 

(a— 3Jo)® , ..PO® 



that is, by a Kennard packet. Show that the time-dependent ^ is then 




exp 

2^ TT^ Va* + iAvt 


exp 


f [a; (gp + vot)]^ 



( 22 ) 


where v = p/m. Note the formula^® 
^-(a«*+/9a+7) g»(«s*+«8+r) 


/- 


Y 4w+»-r J"“’L' 


tf+t 


( ffl - t»)a - I 

4(a» + «») J- 

(23) 


Show that the distribution-in-o; for the state (22) is 13^®. Compare the 
present problem with that which was considered classically in §12 and 
which led to 13^*; point out a condition under which the above-mentioned 
problem of §12 cannot be translated into the mathematical terms of quan- 
tum mechanics and is consequently meaningless from the quantum- 
mechanical standpoint. 

7. Show that for the state (22) av^ = — Po + ~ aJ . 

2m 2m 

8. The theorem that the distribution-in-p for any state of a free particle 

is stationary was proved in the text with the help of the transformation 
formula 68*^. Prove this theorem for the case of states specified by nor- 
malizable i/^’s by computing the time derivative of av^p*, and without 
reference to 58*^^ Note that in the field-free case, according to (8), « 

ih\J/^^/2m, and 9^/9t = —ihif''/2m. 


29. One-Dimensional Rectangular Potential Wells 

The energy spectrum of a linear harmonic oscillator is discrete, 
while that of a free particle is continuous. To give an example 
of an energy spectrum that contains a discrete part and also a 


w See integrals 6 and 7, Table 269, D. Bierens de Haan, Nouvelles Tables 
d^Int&gralee Dhfinies) Leyden, 1867. 



151 


FURTHER PROBLEMS 


§29 


continuous part we shall now 
consider the motion of a 
particle in the field of the 
rectangular potential well of 
Pig. 9“. The energy diagram 
of this system is shown in 
Fig. 28 for the case when the 
less than the depth Vo of the 
when E > Vo . 

The Schroedinger equation i 




- t 

E 







1 c 

t a; 


Fig. 28. A potential well. Energy- 
diagram for the case 0 < E < 7o . 


total energy E of the particle is 
well, and in Fig. 33 for the case 

s now 




( 1 ) 


where 

fUo > 0 for X < —a 

V{x) = ■jo for —a < x < a (2) 

[Fo for a< X. 


To find the possible energies of our particle, wc must compute 
the values of E for which (1), with F(x) given by (2), has well- 
behaved solutions; and in view of the discontinuities in F(a;) we 
proceed as follows. We subdivide the infinite interval into the 
left region (x < —a), the central region (—a < x < a), and the 
right region (a < a:) ; writing ^ in the form 

f ^j for a; < — a 

for —a < X < a (3) 

V'r for a < a:, 

we now find that, if (3) is to be a solution of (1), then the three 
pieces into which we have divided 4' must satisfy, respectively, 
the simple equations 

-h /c(J5 — Fo)^j = 0, 4/'! + kE4/c = 0, 

+ «(E _ Vo)4'r = 0; ^ ^ 


next we find the respective general solutions of Equations (4); 
and finally we determine the circumstances under which 4^1 , 4'e , 
and 4'r , when pieced together, will yield a well-behaved 
We shall carry out this procedure in detail for the case 

0 < F < Fo, 


( 6 ) 



152 THE SCHEOEDINGER METHOD §29 


to which Fig. 28 specifically refers. In terms of the abbrevia- 
tions 

a = = V'/c(Fo - E), (6) 

the respective general solutions of Equations (4) are 
rPi = 4^0 = Aie^ -h 

fr = Cie^* + C'2e-^ . ^ 

where the .A’s, B's, and C’s are arbitrary constants. In view 
of (5), )5 is positive; hence the term Bse"'’® in yj/i misbehaves when 
a; — > _ 00 , and we must set Ba = 0 in order that (3) may behave 
properly at — «> ; the term Bie^® in misbehaves when a: — » « , 
but, since our ^ is equal to i'l only in the left region, this does 
not imply a violation of the boundary conditions by Simi- 
larly, the boundary condition at + «i requires that Ci = 0. The 
solution 

= Bie^* 

^ = he = A^e^ + ( 8 ) 

h = Cae-'’* 


of (1) thus satisfies the boundary conditions for an arbitrary 
choice of Bi , Ai , , and Cz . The general character of (8) is 



Fig, 29. A function of 
the form (8). This func- 
tion is ill-behaved; well- 
behaved functions of the 
form (8) are shown in 
Fig. 32. 


indicated in Fig. 29, which is drawn on 
the assumption that ^ is real and that 
Bi and Cz are positive; 

We turn next to our standard con- 
tinuity conditions, which require that ^ 
and rj/' be continuous throughout the 
infinite region. Each of the three pieces 
of in (8), when taken by itself, is con- 
tinuous and has a continuous deriva- 
tive, and hence we need to consider 
only the points —a and a at which 


the pieces join. Now, ^ will be continuous at —a if and only if 


= iAe(— c^); similarly, will be continuous at —a if and 
only if a) = ^c(— a). Analogous conditions must hold at 


a, so that our xp will satisfy the continuity conditions if and 


only if 


^r(a) = xpc{a), 


4'Uo) = P'M- 


(9) 

( 10 ) 



153 


FURTHER PROBLEMS 


§29 


In trying to subject (8) to the conditions (9) and (10), we noight 
proceed as follows: Taking in (8) as it stands, we adjust 
so as to satisfy the continuity conditions at — o; this can be done 
without diflSculty because contains two adjustable constants, 
and there are just two conditions to satisfy. After this, no 
arbitrariness will remain in , and the values of and ypc at o 
will become fixed. Next we try to satisfy the continuity condi- 
tions at a by adjusting ^l/r ; but here we encounter a complication, 
because must be made to satisfy the Iwo conditions (10), 
although it contains only one adjustable constant, namely, Ct . 
We may therefore expect that in general it is impossible to adjust 
the coefficients in (8) so as to satisfy Equations (9) and (10), 
and that exceptional cases in which this can be done can arise 
only if a, |8, and a are related in some special way. 

To carry out the calculations explicitly! we substitute and 
i/'e , given in (8), into (9), getting 

B^e-^ = (11) 

= Aitae"’” - Astae’”". (12) 

Elimination of Bi yields the relation between Ai and As that is 
required for the proper joining of and : 


A2 = “ 


jS — ia 
/3 + ia 




Ai . 


Similarly, substitution of and into (10) yields 
Cie-^ = Aie’” + A2e-'““ 

= Aiiae^ - Ajfae-'”, 


(13) 

(14) 

(15) 


and eliminating Ct we get the relation between Ai and As that is 
required for the proper joining of and V'r : 


i ^ 2t'ao ^ / 1 

Now, Equations (13) and (16) are in general inconsistent, and 
hence it is impossible in general to piece the three parts of (8) 
together so that the resulting ^ satisfies the continuity conditions. 
In other words, the Schroedinger equation for our particle does 
not possess well-behaved solutions for an arbitrary value of E 
lying between 0 and Vo ; and we draw the physical conclusion that 



THE SCHROEDINGER METHOD 


154 


§29 


not every energy value lying between 0 and Fo is a possible value 
of the energy of a particle moving in the field of Fig. 28. 

It remains to investigate the circumstances under which (13) 
and (16) are consistent, that is, under which 


^ ioi 2iaa ^ ~-2iaa 

— ■ e = r' 6 

P — la p la 


(17) 


Equation (17) is a condition on the quantities a, /5, and a, and 
not on the arbitrary constants in (7) ; if a, Vo , and k are fixed, as 
they are in our problem, (17) is a condition on E. Elimination 
of the complex exponentials in (17) yields 

(j8* — a) sin 2ao = — 2aj8 cos 2aa, (18) 

so that, in view of (6), Equations (13) and (16) are consistent if 
and only if 

tan2oV^ = g^ gr»--^) (19) 

Thus the values of E, lying between 0 and Fo , for which our 
Schroedinger equation possesses well-behaved solutions, are the 
roots of the transcendental equation (19) lying between 0 and Vo . 
These roots, which turn out to be real, discrete, and finite in 
number, are the energy levels of our dynamical system lying 
between 0 and Fo ; they will be labeled Ei,Ei,Ez, and so on, 
in such a way that Ei < E^ < Ez < . • . and so on. 

To complete the c^cussion of the case 0 < JS7 < Fo , we shall 
now find the explicit form of the Schroedinger function be- 
longing to energy value E^ lying between 0 and Fo , the calcula- 
assumption that E^ satisfies (19), so 
that (17) holds, and (13) and (16) are both true. The product 
of the Irft sides of (13) and (16) is Al , and that of the right 
SI es IS 1 , hence A 2 — ±4i , and (16) goes over into 


P "I" f ^2iaa 

|8 — ia 


( 20 ) 


When Bi and ft are evaluated in terms of [note that (11) and 

so (14) and (15)], 

the solution (8) takes the form 


[a («*'’“ ± e~“““)e^"^‘‘-*> 


X < —a 

-a < X < a ( 21 ) 

a < X, 



156 


FURTHER PROBLEMS 


§29 


where 

a„ = , /3n = Vk(7o - . (22) 

and where A is the factor, independent of x but otherwise arbi- 
trary, previously denoted by Ax . To determine whether the 
upper or the lower signs in (21) are to be used in , we may 
substitute En into (20) and thus determine whether the upper or 
the lower sign is appropriate in (20). 

To make properly time dependent, we let A depend on t 
through the factor for then == and rpn 

satisfies the second Schroedinger equation 1^®. 

Besides the discrete energy spectrum lying between 0 and Vo , 
our system has a continuous spectrum consisting of all energies 
greater than Vo (Exercise 6); if jB' = Fo is a solution of (19), 
there is also an energy level at JS = Fo (Exercise 7). Each level 
belonging to the continuous spectrum is doubly degenerate, 
while, as we have already seen, each level of the discrete spectrum 
is nondegenerate. The system has no energy levels besides those 
enumerated above. 

Exercises 

1. Show that every V'-function (21) is either even or odd in a?, and can 
be made real by a proper choice of A, 

2. Show that the choice -4 = iV'n « J(<* + where r is an arbitrary 

real constant, normalizes (21) to unity. 

3. The substitution s = 2a^,KE sends (19) into tan $ =* fi(4FooV — a*)* X 
(a® — 2Foo*#c)’"^; graphs of the two sides of this equation for the case 
Vod^K « 36 are given in Fig. 30. Use Fig. 30 to show that, in the special 
qase Voa^K *= 36, the system“ has four energy levels lying between 0 and 
Fo , and that these are, approximately, Ei =» .067o , E 2 — .20Fo , Et — 
.44Fo , and Ei « .74Fo . These discrete levels are shown in the lower part 
of Fig. 31; the corresponding normalized ^-functions are shown in Fig. 32. 

4. Verify that, in terms of ? « aV^» Equation (20), taken with the 
upper sign, can be written as 

tan I - (Foo»* - (23) 

and, taken with the lower sign, as 

tan^ - -KFoo»/c - (24) 

Draw rough graphs of tan f and of the right sides of (23) and (24), regarded 
as functions of f, and show that the energy levels of the system^^ lying 
between 0 and Fo are spaced as follows: 

0 < a\/A < -J-T < ay/nEi < -f ir < ay/HcEl < < • • • < ay/ kVo » (25) 

That is, a particle of mass w in the field of the one-dimensional rec- 
tangular potential well of Fig. Qi^ 



156 


THE SCHROEDINGER METHOD 


§29 



Fig. 30. Graph for Exercise 3. 

Fig. 31. The energy spectrum of the well of Exercise 3. In this case, the 
value B = is not included in the spectrum. 





Fig. 32. Instantaneous ^-functions for the discrete energy states of the 
well of Exercise 3. The heavy horizontal lines extend on the a;-axes from 
X - —a to a; = a. 


and that the system has at least one level between 0 and Fo . Under what 
condition are there just ten levels between 0 and Fo ? 

6. Show that (21) is even in a; if n is odd, and odd in a; if w is even j sho^ 
also that (21) has altogether n + 1 nodes: one at ~ oo , one at oo , and n — 1 
between —a and a. 

6. Consider the case E > Fo , shown in Fig. 33. Note that in (6) 

is now a pure imaginary, so that 
the terms 526“^* and Cie^^ in (7) 
are no longer ill-behaved in their 
respective regions; show that con- 
sequently (as indicated in Fig. 31) 
any energy greater than Fo is a 
possible energy of the system, 
and verify that the ^-functions 
belonging to the energy value 
E, E > Fo , can be written in 

{Ast + EsOe-*®* + {Asi + Bsa)e"«* (26) 

Ur = 4- (Asi 4- 



-a 0.x 

^ Fig. 33. A potential well. Energy 
diagram for the case E > Fo . 




157 


FURTHER PROBLEMS 


§29 


where A and B are arbitrary constants, and where a = y/lcE^ ai = 

^/k[E - Fo), Si = (a - S 2 = (o!+ ai)(2a)~'e»^“i“«5‘‘, 

8z = (2a£ai)[2o£ai cos 2aa — i{cL^ H- a!) sin 2aale^*“i^ and 54 = i{2aai)~^ 
(■a® — ai) sin 2aa, 

7. Show that E = 7o is a possible energy of the system^i if Q^iy if 
E 7o is a root of (19); and that the corresponding i/'-f unction contains 
only one arbitrary constant, and can be obtained by letting E = 7o in 
(21); is this i/'-function normalizable? 

8. Show that energies less than or equal to zero are not possible energies 
of the system. “ 

9. Consider a particle moving in one dimension in the field of the rec- 
tangular potential harrier 

[0 X — iz 

7(rr)={7o>0 -il^x<il (27) 

[0 il <x 

drawn in Fig. 34. Show that any energy greater than 0 is a possible energy 
of the system. Verify that the Schroedinger functions describing a state 
of energy E have the following forms: For 0 < E < Vo ^ 

Ui « + (Ac2 + Bci)e-^^ 

^ = *1 ~ (Aci + + (i4ca 4" Bci)e~^^ (28) 

[^r - (Aci + Bc2)e^«» + 


I T 1 

Vo 

I I I 

-|I -jj ~x 

Fig. 34. The potential barrier (27). 

where A and B are arbitrary constants, where a and jS are given by (6), 
and where ci — 2a/3[2a/3 cosh pi + i(i8* — “*) sinh Ci = 

— ici(2aj3)”^(/3> + «“) sinh |8Z, cs = Ci(2/8)“*03 — and c^ = 

c,(2|8)-i(^ + For E = Vt, 

+ B[(l - - iiaoZe"*'""*] 

^ - iatW + a;)] + Be**”"'!! - iaoW - s)) (29) 

where A and B are arbitrary constants, and ao =■ V/cVo . Fo r E > Vo , 
the iZ'-functions have the form (28), where |8 = \/ K(Vt — E) = iy/ k{E — Vo). 

30. The One-Dimensioaal Box 

A d 3 mamical system of some interest is that consisting of a 
, particle in a one-dimensional box; the potential describing it is 
shown in Pig. 36, a reproduction joi Fig. 10“. We shall begin 
with a box located as in Fig. 35. 



158 


THE SCHROEDINGER METHOD 


$30 


V: 


1 

« 


<t X 


Elg. A potential box 
extending from a; = — a 
to X = a. 


On account of the highly singular 
character of our new V(a:), we shall treat 
it as a limiting case of the well of §29, to 
be obtained by letting Vo increase with- 
out limit, and shall compute the energy 
levels for the box by letting Fo — >■ oo in 
the expression for the energy levels for 
the weU. In doing this, we may restrict 
ourselves to the discrete levels of the 


well, because the continuous spectrum lies 
above Fo and is of no interest when Fo — > <» . 

When Fo -i- oo the ener^ level equation 19” for the well goes 
over into tan2o\/^ = 0, that is, into 2ay/ kE — nv, and hence 
the eneigy levels of the boxed particle are 


E„ = n\^J4a\ = n%^/Z2Tna^, w = 1, 2, 3, - . . . (i) 

reason for omitting n = 0 will appear presently. 

To obtain the Schroedinger function describing the nth 
ene^ st^of the particle in the box of Fig. 35, we allow Vo oo 
m 21 . When Fo Equations 22” become, in the limit, 


<x„ = iiT/2a, 




( 2 ) 


‘ W sisn ia to be 

(0^ < „ Ai-- ““pal n, v«,jal>e» 

X < -a, ^ vanishes for n > a, and 21” becomes 


0 


X < —a 

“Ot < a; < a (3) 

a < X. 


0 


l X 


**6- 3$. A potential 
^ extending from q; * o 
to a: s=s 


In v^ of (2) and the result of Exer- 
cise 2 the ^^f^ction (3) is normalized 
in /o% ^ iiormalizing fafe- 

i 7 of ^ J course be readily eval- 

^ted ^ly. The vanishing of (3) for 

from (1^ excluding n = 0 

The ^function for the nth energy 
state, of a particle in the box of Fig. 3^ 



159 


FUETHER PROBLEMS 


§30 


extending from x — 0 to a: = Z, can be obtained from (3) by 
writing \l for a and x + §Z for x; when normalized and made 
properly time-dependent, this function is 

0 X < 0 

sin — 0 < x < Z (4) 

w® I 

where is real and independent of x and t, but otherwise arbi- 
trary, and where 


( 1 ) 


n^h 


n = 1,2,3,.... (5) 


The energy spectrum of a particle in a box is thus entirely dis- 
crete and nondegenerate. 

Graphs of (4) for n = 1, 2, 3, and 4 for times at which 
exp i(yn — Ent/h) = 1 are shown in Fig. 37; since is in general 
complex, it can be represented 
by a single line in a plane only 
for exceptional values of t. 

The function (4) is continuous 
everywhere in the infinite region; 
but its first derivative is discon- 
tinuous at the ends of the box, so 
that our standard continuity con- 
ditions are violated. We adopt 
(4) nevertheless, because it is a 
limit of the function 21^® that does 
satisfy all the requirements, and 
because from the physical stand- 
point it is certainly permissible 

to regard the box problem as a tide in a one-dimensional box. 
limit of the well problem. This 

situation shows that our elementary criteria of good behavior 
are not adequate for the treatment of all one-dimensional prob- • 
lems; but the exceptional cases are rather rare. 

The distribution-in-x for the nth quantum state is 



'0 

2 . 2nxx 
Z®‘^ - 

lo 


X < 0 


4'n'l'n = 


0<X<Z 

Z<x 


( 6 ) 



160 


THE SCHROEDINGER METHOD 


§30 



V\AAA 

Fig. 38. Distributions-in-rc for 
energy states of a particle in a 
one-dimensional box. 


and is of course stationary. 
Equation (6) implies that the 
probability of finding the particle 
outside the box is zero, just as 
in the classical theory. Graphs 
of (6) for n = 1, 2, 3, and 4 are 
shown in Fig. 38. 

Distributions-in-j&. Since the 
if'n given by (4) vanishes outside 
the box, an integral of the form 
/ ^nf(x)\pn dx can be replaced by 

^nf(x)}pn dx, and hence the 


computation of averages of func- 
tions of a; is Straightforward* The situation is more involved irx 
the case of averages of functions of p. For example, since 
is normalized, we have, according to the expectation formula 2^^, 


avnp* 



(7) 


Now, the derivatives of are discontinuous and very irregular 
at 0 and at I, and consequently limiting processes must be used in 
evaluating (7). For example, we may choose some function, 
(z, c), of a; and of a parameter c, which has k continuous deriva- 
tiyes when c 0 and approaches when c -»■ 0, evaluate (7) 
mth F(x, c) taking the place of , and let c ^ 0 in the result. 
In pi^icular, we should be prepared to find that this process will 
yield a result different from that obtained by writing 0 and I 

Sectiy carrying out the integration 

r limting processes can, however, be circumvented as 
foUows: The unsatisfactory feature of (7) is that it involves 
Merentiation of ; now, neither the formula for the amplitude, 
(P I n), of the distribution-in-p for the nth quantum state, 

(32") (li\n) = h-* f dx, (8) 

nor the formula expressing avn p* in terms of (p | n), 

av,p‘ = |(n|p)p*(p|„)^p^ 

evaluation of av„p‘ can 
thus be effected without complications by means of (8) and (9). 



161 


FURTHER PROBLEMS 


§30 


To be sure, in deriving (8), we used 35” whose validity depends 
on that of 26 , while in proving the latter we assumed that 
differentiation of g{x) causes no irregularities; but it is perhaps 
tolerably clear that, were we to use in (8) the function F{z, c) 
mentioned above, then calculate (9), and finally let c ->■ 0, 
the result would be the same as that obtained by a direct use of 
ipn in (8), followed by the evaluation of (9). 

Omitting the phase factor and the time factor in (4), since 
these can be appended to the result later, and remembering that 
vanishes outside the box, we find by means of (8) that 

(p I n) = r* jT ^ e-‘^^"‘dx = r ‘ jC* /j/f sin e"*'"*'* dx 



— exp (inir — iplfh) 
nx — pl/h 


1 — exp i—inir — ipl/h) ~\ 
nr + pl/h J- 


Multiplication of (10) by its complex conjugate yields the abso- 
lute distribution-in-p for the reth quantum state 


(n[p)(p|n) 


271* irZ 1 — cos titt-cos {pl/h) 
h ' (n^TT* - pH^/h/Y 


( 11 ) 


The value of (11) at p = ±nrh/l, that is, at p = dz-\/2nljEn, 
where both the numerator and denominator of (11) vhnish, is 
ilk and is independent of n. Graphs of (11) for n = 1, 2, 3, 
and 4 are shown in Fig. 39. The distribution (1 1) is very different 





Note that the short-cut argument, “The energy of a boxed particle is 
all kinetic, and hence, whenever the result of a precise energy measurement 
is certai nly En , the r esult o f a precise momentum measurement is certainly 
either y2mi5» or —\/2mEn,” which, were it correct, would contradict (11), 
fails to inquire into the experimental technique to be used in the meas- 
urements. 



162 THE SCHROEDINGER METHOD §30 


from the corresponding classical distribution-in-p given in Table 
l“ and again in Equation 27“. 

The use of (11) in (9) idelds the following values for the averages 
of the integral powers of p: 

fO if fc is odd (12) 

av« p* = -1 2rnEn if fc = 2 (13) 

[ 00 otherwise. (14) 

The results (12) and (13) agree with those to be expected by 
classical analogy in spite of the dissimilarity between the quan- 
tum-mechanical and the classical distributions-in-p. 


Hzerclses 


im 


1. Show that, for a particle in the box of Fig. 36, av,, x 
— — I, and av» x* = r— — — 


JZ, avn = 
avn Verify that 


for large values of n the two last results approach the corresponding clas- 
sical results. 

2. Obtain (4) and (5) without a limiting process, but by assuming out- 
right that ^ should vanish outside and at the ends of the box, and that ^ 
and should be continuous inside the box. 

3. Study (11) for large values of n. 

4 . To verify that the discontinuities in the derivatives of at 0 and at I 
play an important part in determining the distribution-in-p, show that, 
if the limits of integration in (7) are changed from — <» and to 0 and Z, 
then, in spite of the vanishing of outside the box, we get results in dis- 
agreement with (14) ; incidentally, the results we then get are those expected 
by classical analogy. 

6. To elaborate on Exercise 4, draw a curve like the top curve in Fig. 37, 
except that the corners at 0 and at Z are smoothed out; also draw rough graphs 
of several derivatives of this curve. Then allow the bends at 0 and at I 
to become sharper and sharper, and study graphically the behavior of the 
derivatives. 

6. A boxed particle is in the nth quantum state. Assume that, if the 
length of the box is varied slowly, the system will remain in the nth quan- 
tum state, so that, in particular, its energy at any instant can be computed 
from (5) by letting I be the length of the box at that instant; then show that 
the force exerted by the particle on a wall of the box is 2E„/Z. Note that 
this force equals the classical average force. 


31. Radiative Transitions 

From our discussion of the stationary states, it would appear 
that if an isolated system is once in an energy state it will remain 
in that state forever. Experimental evidence regarding the be- 



163 


FURTHER PROBLEMS 


§31 


havior of atoms and molecules indicates, however, that a dy- 
namical system in an excited state may spontaneously go over 
to a state of lower energy, the transition being accompanied by 
emission of energy in the form of radiation. The reason for the 
failure of our elementary calculations to provide for spontaneous 
transitions is not far to seek: in using the expression H 
12m + V(x) for the total energy of the system, we have at the 
outset excluded the possibility of conversion of mechanical energy 
into electromagnetic energy. In the general quantum-mechanical 
theory,^® the interaction of matter and radiation is allowed for 
from the beginning; there we start with the dynamical system^^ 

atom -f* radiation, (1) 

and the Hamiltonian then consists of terms describing the me- 
chanical energy of the atom, terms describing the energy of the 
radiation, and terms describing the interaction energy of the 
atom and the radiation. It then comes about that every energy 
value of the system (1) given by the theory can be interpreted as a 
possible energy of the atom alone, plus a possible energy of the 
radiation alone, plus a small interaction energy, so that it is still 
permissible to speak of the energy levels of the atom itself. Now, 
the feature of the general theory that is of particular interest to 
us at present is this: if we start with the system (1) at i = 0 in a 
state that can be described roughly as 

atom in an excited state II + no radiation, (2) 

we find that by a subsequent instant t the system may have 
passed over into a state 

atom in a state I + radiation, (3) 

which has the same total energy as the initial state (2), although 
the energy of the atom itself is now smaller. Whether or not 
the transition (2) — > (3) will actually occur, or the precise instant 
at which it takes place if it does take place, cannot, according to 
the theory, be inferred from the information that at it = 0 the 
system is certainly in the state (2); in other words, an excited 


13 Dirac, Chapter XI; E. Fermi, Rev. Mod. Phys., 4, 87 (1932); W. Heitler, 
The Quantum Theory of RadiatioUj Oxford, The Clarendon Press, 1936. 

1* We use the word atom for material dynamical syetem. 



THE SCHROEDINGER METHOD 


164 


§31 


atom may *jump’ spontaneously into a state of lower energy and 
in doing so emit radiation. 

In our elementary discussion, the subject of radiative transi- 
tions is introduced as an afterthought, the fornaulas for the cstl- 
dilation of transition probabilities and of the frequency of tlie 
^emitted light are, taken for granted, and the relation of these 
formulas to the fundamental postulates of the general theory is 
left unclarified. The calculations we shall perform concern sys- 
tems consisting of a particle moving along the a;-axis and are 
therefore only of academic interest; but the mathematical teclx- 
nique to be employed is a fair prototype of that involved in actual 
physical problems. Having as yet made no provisions for dealing 
with physical phenomena taking place in three-dimensional space, 
we are of course entirely unprepared to discuss the polarization of 
the light emitted by actual systems, or the dependence of its in- 
tensity on the direction from which it may be viewed. 

Our first assumption is the celebrated Bohr frequency condiiioni^^ 

A. A spontaneous transition of a dynamical system from an 
emrgy state II of energy Eu to an energy state I of lower energy Ex 
is accompanied by the emission of radiation of spcctrosgopic fre^ 
quency , given by the formula 


Ex), ( 4 ) 

where h is Planck^ s constant. 

The mea^e of the hkelihood of a spontaneous transition fronx * 
a state II to a state I is taken to be the constant, denoted by 
n-i, such that is the probability that the transition 

S ri ^ infinitesimal interval dt if the system 

“ interval dt; 

II ^ T the probability of the quantum jump 

mZ Sf than itsdf, is a proba- 

theWIn^ connection with transition probabilities, wo introduce 

br ^ 

B n. # his paper inaugurating quantum mechanics : 

fiem an anigy .(ai, 

W. Heisenberg, Zeits. f. Phy^ik, 83, 879 (1925). 



FURTHER PROBLEMS 


165 


§31 


II to an energy state I of lower energy is, to a high degree of approxi- 
maiion,^' given by the formula 


= 


'Zh(? 


fX 

00 


xxl/idx 


/ OO Too 

i'n'I'iidx / \[fi\pidx 

■oO ^—00 


(5) 


where e is the electric charge of the particle, c is the velocity of light, 
h is Planches constant, and xpn are Schroedinger functions speci- 
fying, respectively, the states I and II of the system, and where vn^i 
is given by (4). 

If and ^ii are both normalized to unity, (5) becomes 


^ir-i 


647r^€^(»/ii-*i)^ 



^iix\l/i dx 


|2 


( 6 ) 


The line spectrum of an oscillator. To illustrate the application 
of the assumptions A and B, we shall now compute the transi- 
tion probabilities for a linear harmonic oscillator consisting of a 
particle acted on by a restoring force —kx, and having mass m 
and charge e; in view of the factor in (5), a system cannot 
radiate unless it carries an electric charge. The normalized 
Schroedinger functions for the energy states of the system are 

(7“) n = 0, 1, 2, • • • , (7) 

where a = (hf/km)^ and JV„ = (a2”re! 

The transition probability, , for a spontaneous quantum 
jump of the oscillator from the wth to the n'th energy state 
depends, according to (6), on the integral 


j ^nX4'n'dX, 


( 8 ) 


We shall treat (5) as exact; in the general theory, besides terms having 
the form (5) and describing the so-called dipole radiation, there appear 
terms describing multipole radiation. See E, tJ. Condon and G. H. Shortley, 
The Theory of Atomic Spectra [hereafter cited as Condon and Shortley], 
Chapter IV; Cambridge: The University Press, 1935. 



166 THE SCHR0EDIN6ER METHOD §31 

where the fa are given by (7). According to 41* the values of 
this integral are 

if n' == ?i - 1 

fj , , ( 9 ) 

ifn' = »i + l 

n otherwise. 


The first line of (9), when used in (6), yields 

_ d2T*^a\vn^.-d' „ ( 10 ) 

An^n-l - 3 ^^ 

For an oscillator, Sn = (^^ + i)hvc , so that, according to (4), 

Pn~*n-1 = — (n — 1 + i)hve] ~ > (^0 

and consequently 


As we should expect, An-^n-i vanishes for n = 0. 

We next consider transitions from the nth state to a state other 
than the (n — l)th. The second line in (9) does not concern 
spontaneous transitions, because there n' > n; the third line 
shows that ArUn' vanishes whenever ri' < n — 1. Consequently 
the only possible direct spontaneous transition that an oscillator 
in the nth quantum state can make is into the (n — l)th level. 
The spontaneous transitions of an oscillator are represented 
diagramatically in Fig. 40, which refers to the five lowest quantum 
states; the horizontal lines indicate the rela- 


n=4 



Fig. 40. Transi- 
tion diagram for a 
linear harmonic os- 
cillator. 


tive positions of the energy levels, and the 
arrows represent possible spontaneous transi- 
tions. Such a diagram sunomarizes what we 
call the selection rules for the transitions; but 
it does not display the respective probabilities 
of the possible transitions. 

In an experimental study of the light emitted 
in the spontaneous transitions of an atom, an 
assemblage of similar atoms is confined in a 
chamber and is subjected to the action of a 
suitable exciting agent; and the light emitted 
by those atoms that become excited and then 



167 


FURTHER PROBLEMS 


§31 


make spontaneous ‘downward' transitions is analyzed spectro- 
scopically. The positions of the resulting spectral lines can be 
calculated theoretically from the knowledge of the energy levels 
and selection rules; but the intensities, both absolute and rela- 
tive, of the possible lines depend also on the method of excitation, 
and so forth. 

Note that the spectroscopic frequency of the light emitted in 
any spontaneous transition of an oscillator is equal to the classical 
vibration frequency of the oscillator; hence the line spectrum of a 
linear harmonic oscillator consists of a single line of frequency vc . 
According to our elementary calculations, this line is sharp; the 
so-called natural width of a spectral line (to be distinguished from 
the width due to the Doppler effect and collision effects) is ac- 
counted for by the general theory. 


Exercises 


1 . Let the energy states of a dynamical system having energies smaller 

than that of a state II be denoted by I', I", I'", and so on, and let Pii(0 
denote the probability that the system will still be in the state II at the 
instant t if it is certainly in this state at < » 0; show that Pn(0 exp 
[ — (Aii^x' + -h Aix--^r" -h • • Od* The time interval during which 

Pxi(t) decreases by the factor is called the mean life of the state 11. 

2. A linear harmonic oscillator is certainly in the state n = 2 at i « 0; 

show that the probability that it will be in the normal state at the time t 
is (1 — where a =» 2€*fc/3cW. Hint'* show first that the probability 

that the oscillator will be in the normal state at the time ^ having jumped 
from the state n ^ 2 to the state n =* 1 between the instants t\ and i\ -{• dti ^ 

3. A linear harmonic oscillator is certainly in the nth quantum state at 
i = 0; show that the probability that it will be in the normal state at the 
time f is (1 — where a =* 2€*/c/3c*m*. 

4. Show that if and yph are any two of the functions 4*®, then 


/ 


"^nXypkdx ’’ 


yk) ifn^-fcigodd, 


i^{n^ - h^) 


if n» A:, 
otherwise. 


( 13 ) 


6. Use (13) to show that the transition probabilities of a particle of mass 
m and charge « in a one-dimensional box of length I are 

An-^h n* — ¥ 

(0 if n -|- A; is even, 

and verify the transition diagram for the six low^est levels given in Fig. 41. 


if n -1- A: is odd 


( 14 ) 



168 

n-6 


THE SCHROEDINGER METHOE) 


§31 


n-5 


71-4 


71-3 


71-2 


71-1 


“wt luotit levels, and that thm produc- 

'"'^S^^Convinoe yourself ftut 

niake spontaneous radiative transitions cwn iMt 

carries an electric charge, f S 

particle as a limiting case of a ® ‘ nroc^durc. 

(5) and some appropriate limiting procedure, 

say that of Exercise 5®^ 

32e Properties of the One-Dimensional First 
Schroedinger Equation 

It the case of a linear harmonic oscillator, 
a free particle, a particle in a rectangular 
potential well, or a particle in a box, the 
Schroedinger eigenfunctions of H can, as we have seen, be com- 
puted readily. There are a fett other special cases for whicli tn^ 
eigenfunctions of H have been computed explicitly; but m general 
the solutions of the Schroedinger equation cannot be expressed in 
terms of the elementary functions. Fortunately, importan 
properties of the solutions can be inferred without solving the 
equation explicitly; aud it is the purpose of this section to enu- 
merate several of these and to provide hints concerning the more 
involved proofs.” 

We assume that the first Schroedinger equation for one-ciimen- 
sional motion, namely, \l/" -{■ k[E — F’(a:)]V' = 0, bas been trans- 
formed into” 

u" -t- [X — g(x)]ti = 0, (1) 

where the new independent variable x, the function g(x), and the 
parameter X are all free from physical dimensions; V (x), and con- 

^ Much of the discussioa of this section is based on the work of W . K. 
Milne, Trans, Amer. Math. Soc.j 30, 797 (1928), and Phys. Rev.j 35, 863 (1030). 

“ See, for example, the transformation from P* to 4*». 


Fig. 41. Transi- 
tion diagram for a 
particle in a one- 
dimensional box. 



169 


FURTHER PROBLEMS 


§32 


sequently g{x)j and all their derivatives will be assumed to be 
continuous. The symbol u will denote (a) particular solutions of 
(1) that are not identically zero, but that are not necessarily well- 
behaved; (b) particular real solutions of ( 1 ) that are not identi- 
cally zero, but that are not necessarily well-behaved; and (c) the 
general solution of (1). It should be clear from the context which 
use of the symbol u is implied. By eigenfunctions and eigenval- 
ues of equation ( 1 ) we shall mean the eigenfunctions and eigen- 
values of the operator —^/d:i^ + g{x)j that is, well-behaved so- 
lutions of (1) and values of X for which such solutions exist. 
Several of the theorems that we proved in Exercise 4^ and else- 
where are special cases of those taken up here. 

1. If a solution of (1) and its first derivative both vanish at a 
finite value of re, then this solution is identically zero; both u and 
u' may, however, approach zero when a; db « , if w is not iden- 
tically zero. 

2. If .u{x^ 7 ^ 0 and X > gix^), then u^'{x^)/u{x^) is negative, 
and hence u is concave toward the a:-axis at a; = xo ; similarly, 
if u{xo) 9^ 0 and X < g{xo), then w"(xo)/u(xo) is positive, and 
hence u is convex toward the x-axis at x = Xo . 

3. If w(xo) = 0 and X 9 ^ (/(xo), then w"(xo) = 0 and w'"(xo) 9 ^ 0, 
so that u inflects at x = Xo . 

4. If ^^(xo) 7 ^ 0, X = ^(xo), and g^{xf) 9 ^ 0, then w"(xo) ” 0 
and w'"(xo) 9 ^ 0 , so that u inflects at x = Xo . 

6 . If u^ vanishes at xi and at X 2 , and if X > ^(x) everywhere 
between xi and Xz , then u vanishes at least once between Xi 
and Xz . 

[X — g]%dx 

I 

rx2 

~ 0, that is, / [X — g]udx = 0, so that, if [X — g] does not change sign 
between xi and X 2 , then u must do so.} 

6 . The function uu' can, at most, vanish once in a region 
throughout which X < g(x); for example, u cannot have two 
nodes, or a maxinaum and a node in such a region. 



170 


THE SCHEOEDINGER METHOD 


§32 


(If uu^ vanishes at xi and re* , then, by Rolle's theorem, d(uu')/dz must 
vanish at least once between xi and ; now, d(uu^)/dx « w'w' -{- uu^' = 
(w')* -J- [flf — X]w*, and hence d(uu')/dx > 0 wherever X < } 

7. If and are two particular solutions of (1) then 

uiui — it^u[ = c, ( 2 ) 

where c is a constant. 

(Differentiate (2) and eliminate and wj' by means of (1).} 

8. The condition c = 0 in (2) is necessary and sufficient in 
order that Ux and t42 be linearly dependent. 

(Necessity: if and Uu are linearly dependent (so that w* = const. X “Ui), 
then the left side of (2) vanishes. Sufficiency: set c = 0 in (2), getting 
ujui =“ itj/ua , and integrate, getting log ui » log Ui + constant, that is, 
w* = constant X .} 

9. Two linearly independent solutions of (1) cannot vanish at 
the same value of x, 

10. If ui is a particular solution of (1), then in = ui ^ dx, 

where xo is an arbitrarily fixed value of x, is another particular 

solution of (1); further, ui and 112 are linearly independent, so that 

u = Aui -|- Bui j Ux^ dXy (3) 

where A and B are arbitrary constants, is the general solution 
of (1). 

I u, * da; + f dx Ut^ ; ^ f Ui^dx + 

-2 " / P /•» 

i4^i ' - ^^ 1 V = mJ' / ui^dz-^ Iff- \]ui j wi-a dz^[g- X] Ui ; hence 
•/®o Jxo 

Us satisfies (1). Substitution of and us into (2) yields c = 1, and hence 
(theorem 8) ici and Us are linearly independent. ) 

11 . If zti and are the two particular solutions of (1) that 
satisfy the conditions^ 

«x(0) = 1, «i(0) = 0, ■U,(0) = 0, t4(0) = 1,- (4) 


“ It is ctmte immaterial for our purposes whether the explicit analytic 
expressions for tii and ui are available or not, 



171 


FURTHER PROBLEMS 


§32 


and V is the function 

»(®) = Vul + Ms , (5) 

then 

v(0) = 1, v'(0) = 0 (6) 

and 

v{x) > 0, (7) 

SO that t; is a nodeless function of x, 
jThe equation 

wiwj — utUi = 1 (8) 

holds for a: « 0 and hence (theorem 7) it holds for all values of x; conse- 
quently ui and Ui are linearly independent (theorem 8), and (7) follows 
from theorem 9. Fig. 42, computed numerically, shows t; for X » 1, 2, and 
3 when g(x) « a;*. } 



12. If ui and ih are the particular solutions (4), and 4> is the 
function 

<l>{x) = tan~^ U 2 /U 1 , (9) 

then 

m = 0 (10) 

and 

( 11 ) 

so that <l> is an increasing function of x, 

5= d(tan"^ Ui/ui)fdx =* (wiwj — Uiu[) (i^J H- (11) then follows 
from (8) and (5). Graphs of for X » 1, 2, and 3 when gix) » a;* are shown 
in Fig. 43, computed numerically. } 



172 


TEE SCHEOEDESTGER METHOD 


§32 



13. The geaeral solu.ti<in. of (1) can be written in the form 


u = Cv{x) sin [ 9 b(a:) - d\, (12) 

where C and 6 are arbitrary- constants it is then the factor 
sm - 9] that is responsible for the nodes of u. 


{Since mi and wj satisfying (4) are linearly independent (theorem 8 ), 
the genial solution of ( 1 ) is u = Am + Bm . Using ( 5 ) and (9), we get 
Aui 4 Bm = (ui 4 myUm/iut + 4 + u|)»] = v[A cos A 

Bsinj,] = 4 S*)! vU cos 0 /(^* 4 ffs)} 4 b sin 4>KA> 4 and, 

introducing the constants = A * 4 S* and e = - tan-i A /B, we get (12) ) 


14. The function ti(a:) is the particular solution of the equation 

"" + [^ - ff(a:)]ti = ( 13 ) 

that satisfies the conditions 


‘’(0) = 1, t,'(0) = 0. (14) 

(13), substitute (K) into (1), noting that, in view of (11), 
u = C(u" - »-3) sm - 9] = (»" _ } 


16. If 


w" H- [^ - 9{x)]u = 0, (15) 

and 

. + [X. + « — gCx)]!! = 0, (16) 


thm whe, , ,nd X in (1) m glvet, 



173 


FURTHER PROBLEMS 


§32 


where c is a positive constant, then U has a node between every 
two nodes of u; that is, the nodes of u are separated by nodes of 
U, This theorem is a special case of Stumps first comparison 
theorem^^ 


(Multiply (15) by U and (16) by u; subtract, getting Uu" — =* euUf 

that is, d(u'U — uU*)ldx — €uU; integrate this between fwo successive 

nodes xi and Xa of u, getting u'(x 2 )U(x 2 ) — u{x 2 )U'ix 2 ) — u'(xi)Uixi) + 
rx2 

u{x\)U'{xi) = c / uUdXy that is, 

Jxi 


u'(x 2 )U(x 2 ) - u'(xi)Uixi) 


•r 


€ I uU dx. 


(17) 


Suppose now that u is positive between Xi and X 2 and that U is positive at 
and between xi and X 2 ; then (Fig. 44) the left side of (17) is negative, while 
ihe right side of (17) is positive, and we have 
an inconsistency. We may change the sign of u, 
of U, or of both, and may allow V to have a 
node at xi , ov a node at X 2 , or nodes at both 
xi and X 2 ; but the inconsistency persists until we 
allow U to have a node between Xi and X 2 .} Fig. 44. 



The bound case. If F(a;) oo when | a; | then the 

classical motion of a particle of given total energy does not 
extend outside certain values of x, and we call the motion hound. 
If Xi and Xi are, respectively, the smallest and the greatest roots 
of the equation g{x) = X in a bound 
case (Fig. 45), then we use the terms 
Uft ' outer j inner, and right outer 
region to denote, respectively, the 
region at the left of Xi , that be- 
tween xi and Xi , and that at the 
right of Xi . 

16. In the bound case, u has at most a finite number of nodes. 

{ It is perhaps obvious that u has a finite number of nodes (if any) in the 
finite inner region; and it has at most one node in each of the two outer 
regions (theorem 6).} 



** E. L. Ince, Ordinary Differential Equations [hereafter cited as Ince], 
page 228. 



174 


THE SCHROEDINGER METHOD 


§32 


17. In the bound case, (f> satisfies the conditions 

lim = ^ , lim ^(a:) = ^ , (18) 

*—♦—00 -*00 

where the numbers <f>i and ^ are finite. 

{Were ^ to decrease without limit when a; , or to increase without 
limit when a; , the sine factor in (12) would cause u to have infinitely 
n^y nodes, in contradiction to theorem 16. Fig. 43 provides an illustra- 
tion of the conditions (18).} 


18. In the bound case, 0' — > 0 when | a; | oo . 

{Otherwise Equations (18) cannot hold, never being negative.) 

19. In the bound case, t; — > oo and v' — > oo when j a; | — > oo 


{That 0 00 when | a; | —> oo 

follows from (13) that t;" oo 
when ( a; j — » oo.} 


follows from (11) and theorem 18; it then 
when I » I -» 00, and that hence »' -♦ oo 


In the bound case, Equation (1) possesses (a) solutions 
that approach zero when ® ^ - oo; (b) solutions that approach 
when a: ^ (c) solutions that approach zero when 

a: — » OO , and (d) solutions that approach ifc oo when a; — > oo . 

(Solutions of tj^e (a) are 


Av{x) sin [^(a;) - 

where A is an arbitrary constant. When a; - oo 'n ^ 

resolve the indetenniLy, we rewS f ' “ -0; to 

replace this by the Quotient of • J ^ ^ "" 

nominator, thit is, by 4 cos [* - i ?! nujnerator and de- 

the last fraction tends to zero when zU -Jhhl ” 

does also. en rc-^ oo (theorem 19), and hence (19) 

form 09)aresho^nKg^«^^‘^“|*g ^ having the 

The point at which a ^ shown have the same 4L . 

region is marked by a crossline *ln tt?"] *^'**‘* 

curves lie too clo^ toStoto be dJl 

increases the nodes (excent that ^ s^rately. Note that as X 

from theorem 15 (an 

and that after the Utmost ® “ « ) 

node forms at + Note also thp <r * inner region a new 

«»d X = 3, tharis, for Vs that ioxX^l 

tion (1) when , = i». ’ to 28*, are eigenvalues of Equa- 



175 


FURTHER PROBLEMS 


§32 



CtiCa) sin [4>{x) - (20) 

«o'“binations of Bv(x) cos [^(a:) - ^,] and 
(19); those of type (d) are linear combinations of I>s(a!) cos U(x) - a,] 
and (20).) ^ ^ 


21. If u remains finite in an outer region, then it approaches 
zero when x recedes outward in this region. 

JThe solutions v sin l<t> - ^,] and » cos [* - ^j] are linearly independent 
(theorem 8), so that the general solution of (1) can be written as 


w = Cv(x) sin Mx) - ^a) + JDtiC*) cos [^(®) - (21) 

where C and D are arbitrary constants. When a: -♦ », the first term in 
(21) approaches zero and the second term increases without limit (theorem 
20); hence w remains finite when x-^ «> only if D - 0; but in this case u 
approaches zero when x-* oa. Substituting for ^a in (21), we get the 
general solution in the form suitable for proving the theorem for the left 
outer region, 

A graphical argument runs as follows: Let M(a:o) > 0 (Pig. 47), where xt 
lies in the right outer region, so that u is convex toward the a:-axis every- 
where at the right of a:o (theorem 2). Now (a) if «'(*o) > 0, then u-* » 
whena: -+ »■ _ If u'ixt) < 0, there are three possibilities: (b) whena:-* », 
tt remains positive, has a minimum at the right of xt , and then goes to « ; 


176 


THE SCHROEDINGER METHOD 


§32 


(c) u approaches the rr-axis when a; — > « ; (d) 
u crosses the a;-axis at the right of xq , inflects 
at the crossing (theorem 3), and then goes 
to — 00 . Thus u remains finite only in the 
case (c). Similar arguments apply when xo 
is in the right outer region and u(xo) < 0, 
and when xo is in the left outer region, } 

22. In the bound case, an eigenfunc- 
tion of Equation (1) vanishes when 

\x \ CO, 

23. The quantity <#>2 — is an in- 
creasing function of X and tends to 
when X — > 00 . 

{Differentiate (1) partially with respect to 
X, multiply the result by u, and substi- 
tute — u" for [X - g]u, getting u'*du/d\ - ubu’* /d\ = w*, that is, d{u'du/d\ 
— ubu^/dX)ldz =« w*; substitute (12) into this, getting 


-1 


dv'^ 

i • «r -.9^ dV , ,1 

dx\ 

\ ax 

3Xy 

JsinM* -3] +— + »-» — sin 2[^ -e]> = »*sin“[4. - 3], 


( 22 ) 

Choose some value of a;, say , and set d = the express io n in ( } 

which vanishes at a; 0 in view of (6) and (10), then becomes b<f>ixo)/bX 
at a? = Xo , so that, integrating (22) from 0 to xo , we get 

a<^(xo) 

sinH<^(a;) - 0(xo)] dx. (23) 

The inte^and in (23) ia never negative, ao that the algebraic aign of the 
integral is the same as that of xo ; hence, when X increases, <l> increases 
for positive and decreases for negative values of x, as illustrated in Fig. 43. ) 

24. In order that X be an eigenvalue of Equation (1) in the 
bound case, it is necessary and suflScient that 


a 



Fig. 47. Possible forms 
of u in the right outer 
region. 


- 71 = 1 , 2 , 3 , 


(24) 


(24) liolds, then v sin » sin l4> + nw - 42 ] - 

w! , “ “solution of type (19) is also of type (20), and 

Sen u T ' "• Necessity: if u is an eigenfunction, 

then u - 0 when 1 x | -^ « (theorem 22), so that both .h - e and 4 ,, - 9 

bvtS multiples of v. The condition (24) is illustrated 

by the graphs of ^ for X = 1 and X = 3 in Pig 43 ) 



177 FURTHER PROBLEMS §32 

26, In the bound case, Equation (1) possesses a discrete and 
nondegenerate eigenvalue spectrum of the form 

> ^2 j • * * )^n f • * * (25) 

where Xi < X 2 < • • • , and where Xi is greater than the smallest 
value of g(x). 

(An eigenfunction of (1). vanishes when x -► — « (theorem 22) and 
hence has the form (19). If X is smaller than the smallest value of g, then 
(19) has no nodes besides that at —00 (theorem 6); hence <^>2 — 
so that (24) does not hold, and X is not an eigenvalue. As X increases, the 
quantity <i >2 — 4>^ increases also (theorem 23), and we have an eigenvalue 
whenever (24) holds. Were an eigenvalue degenerate, the corresponding 
general solution of (1) would be well-behaved; but this is impossible in 
view of the existence of solutions of types (b) and (d) of theorem 20. 1 

26. In the bound case, an eigenfunction Un of Equation (1) 
belonging to the eigenvalue^® Xn has n -f 1 nodes: one at — 
one at co, and n — 1 in the inner region. 

27. In the bound case, every eigenfunction of Equation (1) is 
quadratically integrable in the infinite region.^® 

28. In the bound case, eigenfunctions of Equation (1) belonging 
to distinct eigenvalues are orthogonal. 

(Let Un and be eigenfunctions of (1) belonging to eigenvalues X„ 
and Xm , so that 4- [Xn — * 0 and 4- [X^ - pJ-Wm = 0. Multiply 

the first equation by Um . , the second by Un , , and subtract, getting 
UrnU^ UnUp^ = (Xm 'Kn)UnUm , that IS, d{Uf^V>m “ f dx “ 

(Xm — Xrt)wnWm . Integration from — 00 to « yields 0 =* (X^ — Xn) J UnUmdx, 
so that / UnUm dx ^ 0 whenever Xn X^ . The case of complex u^b is taken 
up in Exercise 3.) 

29. If g{x) is even in x in the bound case, then every eigen- 
function of Equation (1) belonging to the eigenvalue^® Xn is even 
in a; if n is odd, and odd in a; if n is even. 

(Write —a; for a; in (1), getting d®w(~a:)/d(— a;)* 4- [X — g(—x)]u(—x) « 
0; since d^/d{—xy = d^/dx^ and since in the case under consideration 
g(—x) = g{x), this equation becomes d^u(,—x)/dx^ 4- [X — gix)]u(--x) « 0, 
so that u(— a;) satisfies (1) when u(x) does so. If u(x) is well-behaved, 
then M(“a;) is also well-behaved, so that if Un(x) is an eigenfunction of (1) 
belonging to the eigenvalue X^ , then Uni—x) is also an eigenfunction of 


The labeling is that of (26). 



178 


THE SCHROEDINGER METHOD 


§32 


(i) Moagiag to the same eigenvalue; since the spectrum is 

(theorem 25), it then follows that «»(-*) = cm.C*), where e is s ■ 

atant. Writing -a for a in the last equation, we get «»(*) =» cuA 

and hence «.(a) = o^n.Ca); thus c« = 1. and ^ 

when g(x) is even in x, then every eigenfunction of (1) is eith 

odd in 2 . The correlation between n and the oddness or evenness of Un 

follows from theorem (26) and the fact that an odd function has an oaa 

number of nodes, while an even function has an even number o no es, 

zero being counted as an even number. } 

30. In order that X be an eigenvalue of Equation (1) iu t*hc 
bound case, it is necessary and sufficient that 




'dx == 


== 1, 2, 3, . • ■ . (26) 


Jq 


i According to (11) and (10), = J v~‘^ dx] hence (f>i^ j' 


and (24) is equivalent to (26).) 


Coaditioiially bound cases. Certain general characteristics of 
the eigenvalue spectrum of the Schroedinger equation can be 
inferred also for cases other than the bound one. For example, 
if, as shown in Fig. 48 (a), Viz) has a minimum, increases without 




179 


FUETHER PROBLEMS 


§32 


limit when a; — > — , and approaches a constmt when x oo , 

then the spectrum is discrete and nondegenerate in the region A, 
and continuous and nondegenerate in region B. Again, if, as 
shown in Fig. 48 (b), V(x) has a minimum, and approaches one 
constant when x — oo and another when x oo, then the 
spectrum is discrete and nondegenerate in region A, continuous 
and nondegenerate in region B, and continuous and doubly 
degenerate in region C. 

Exercises 

1. Taking g(x) and X from Fig. 45, draw the graph of a function that 
violates each of the theorems 1, 2, 3, 4, 5, 6, and 21. 

2. Show that in the bound one-dimensional case every instantaneous 
eigenfunction ^ of the Schroedinger equation has the form tj/ « CR(x)j 
where R(x) is a real function of x, and C is a constant, real, imaginary, 
or complex. 

8. Prove theorem 28 when the eigenfunctions are allowed to be complex. 

4. Show that when V(x) is as in Fig. 48 (a) then (1) possesses, for every 
real value of X, solutions of types (a) and (b) of theorem 20. 

5- Show that if FCx) is as in Fig. 48 (a) or in Fig. 48 (b) and if X is less 
than Vo , then (1) possesses solutions of types (a), (b), (c), and (d) of 
theorem 20. 

6. Justify the classifications of spectra given in Fig. 48. 

7. Let ui and ut be solutions of (1) such that mi(xo) = 1, u[ixo) « 0, 
Uiixo) = 0, Wj(xo) = a > 0, and let wix) « (wj + u\y; then show that w 
is the particular solution of the equation w" -f [X — g{x)]v) « that 
satisfies the conditions w(xo) 1 and z«'(xo) “ 0; that the general solution 

of (1) is w “ Cw sin j^a J dx — $ 

stants; and that in the bound case th 
n “s 1, 2, 3, • • • , is necessary and sufficient in order that X be an eigenvalue 
of (1). 

8. Examine (3) for the case when a node of wi lies between xo and x. 

9. Set g(x) =• 0 in (1) and compute analytically the function v defined 
by (5) and the function <i> defined by (9). Contrast the results with those 
shown in Fig. 42 and Fig. 43, referring to a bound case. 

10. Set g(x) «« 0 in (1) and compute analytically the function w defined 
in Exercise 7. Note the simplification that results when a is taken to 
be Xi, 


J, where C and B are arbitrary con- 
le condition a / dx *=* mr^ where 



Chapter V 


APPROXIMATE METHODS FOR TREATING THE ONE- 
DIMENSIONAL SCHROEDINGER EQUATION 

33. Ntimerical Approximatioas 

The Schroedinger equation can be solved explicitly in terms of 
known functions for only a few special forms of V(x), so that in 
many problems approximate methods must be employed. A 
number of such methods, adapted to various types of situations, 
have been developed, and in this chapter we shall illustrate a few 
of them. We shall be concerned with instantaneous ^-functions, 
so that in dealing with bound motion we may, according to the 
result of Exercise 2®^, restrict ourselves without loss of generality 
to real solutions of the Schroedinger equation. 

We begin with two numerical methods, which we shall use to 
compute the lowest eigenvalue and the corresponding ^-function 
for the Schroedinger equation 

r + ^ = 0 , ( 1 ) 

which concerns the motion of a particle in one dimension under 
the action of a restoring force proportional to the cube of the 
displacement from a: == 0. Introducing the parameter 

X = k^JT^E (2) 

and the new independent variable 

(3) 

which we denote (risking an ambiguity) by Xj and writing u for 
we transform (1) into 

u" + (X — x^)u = 0, (4) 

an equation of type with g{x) = In view of theorems 22, 
25, and 29 of §32, the spectrum of (4) is discrete and nondegen- 
erate, the eigenfunctions vanish when 1 a; 1 -> oo , and Ui (the 
eigenfunction belonging to the lowest eigenvalue Xi) is even in x, 
so that we may restrict ourselves to the positive half of the x-axis. 

180 



181 


APPROXIMATE COMPUTATIONS 


§3f 


The methods of this section employ numerical integration o\ 
differential equations. Given an ordinary differential equatior 
involving no literal symbols besides the independent and de- 
pendent variables and the derivatives of the latter, and given the 
numerical initial conditions that a particular solution must satisfy 
at some point rr = oio , it is in general possible to calculate nu- 
merically the values of this particular solution step by step foi 
points other than xo and to extend the solution through anj; 
desired finite interval of x. Several ways of doing this are avail- 
able/ but it would take us too far afield to describe any of them 
here. The choice of a particular process of numerical integration 
depends on the type of equation to be handled, on the mechanical 
computing devices available, and on the computer's judgment. 

The method of two-way mtegration. Let us assume that 
X = Xi in (4), Xi being precisely the lowest eigenvalue of (4). 
Further, let a; = rci be some positive value of x, let Ua be the 
solution of (4) such that Wa(0) = 1 and ul(0) = 0, and let 
be the solution of (4) such that u$(xi) = Ua(xi) and Ufi{x) — ^ 0 
when jr — > 00 , It is perhaps obvious that then Ua and match 
properly at Xi , that is, that in addition to the equation u^ixi) = 
Ua(xi) we have u'^(xi) = UaM* The method of two-way integra- 
tion, about to be illustrated, enables us to approximate Xi by 
studying the departure from proper matching of solutions Ua and 
Up computed for trial values of Xi . 

First trial value of Xi . It is necessary first to make at least 
a rough estimate of Xi . In practice, such estimates can be made 
by reference to experimental data, by comparing the equation in 
question with similar equations whose properties are known, or 
by other methods; in treating (4), we shall rely on an approximate 
analytical formula to be discussed in §34, which estimates the 
two lowest eigenvalues of (1) as .87 and 3.77 and 

shall accordingly adopt 

xi = .87 (6) 

as our first approximation to the correct value of Xi in (4). 


1 See, for example, A. A. Bennett, W. E, Milne, and H. Bateman, ATw- 
imrical Integration of Differential Equations, Bulletin No. 92 of the National 
Research Council, Washington, D. C., 1933; or J. B. Scarborough, Nu- 
merical Mathematical Analysis, Baltimore, The Johns Hopkins Press, 1930. 



182 


THE SCHROEDINGER METHOD 


§33 


First rightward iviegration. We put (5) into (4), getting 

u" + (,87 - x^)u = 0, (6) 

calculate numerically that partic- 
ular solution ixa of (6) which equals 1 
at a; =■ 0 and has a horizontal tangent 
there, the computation extending to 
Xi = 1.2, say. The resulting Ua is 
shown in Fig. 49; the terminal values 
of Ua and Ua are given in Table 3, 
where the initial conditions are printed 

n 

1.0 


.5 


0 

0 .5 1.0 1.6 2.0 2.5 X 

Fig. 49. Solutions of (4) for trial values of X. 

First UJtward integration. Our next task is to compute that 
solution of (6) which equals .504 at Xi = 1.2 and vanishes when 
z 00 . It is impossible to do this without using some analytic 
approximation,^ and hence we compute instead the solution that 
equals .50,4 at a; == 1.2 and has a horizontal tangent at a sufficiently 
large value of a?, say at 0:2 = 3.0. To 

begin with, we compute, working from Table 4: x[ = .87 
right to left, the solution Ub that equals 
.001 at Xi = 3.0 and has a horizontal 
tangent there; this solution will turn 
out to have a wrong value at Xi = 1.2, 
but a correction for this can be made 
later by multiplying by a constant. A portion of the graph 
of Vb , computed by numerical integration, is shown in Fig. 
49; the terminal values of and ui are given in Table 4, the 
initial conditions being again printed in boldface. . 

* It is usually expedient to combine the purely numerical work with an 
analytical study of the behavior of the solutions for large values of 1 x \. 


X 

Ub 

Ub 

1.2 

4.010 

- 7.636 

3.0 

.001 

.000 




in boldface. 



183 


APPROXIMATE COMPUTATIONS 


§33 


Matching Ua and Uh at Xi. Accord- Table 6: xJ « .87 
ing to Tables 3 and 4, the ratio of ua to 
Ub at xi = 1.2 is .504/4.010 = .126. 

Consequently we can get the solution 
of (6) that equals .504 at Xi and has a 
horizontal tangent at x ^ — the solution 
denoted by Ub — simply by multiplying Uh by .126; see Fig. 49 and 
Table 5. . 

An estimate of Uiix^, Although Ua and Ub are equal at Xi , 
their derivatives at Xi^ equal respectively to —.541 and —.947, 
are different; hence .87 is not the correct value of Xi, and we 
proceed to improve it. Let ui be the as yet unknown eigenfunc- 
tion of (4) belonging to the lowest eigenvalue and equal to 1 at 

= 0. The slope of ui at Xi — 1.2 is not available, but an estimate 
of it can be made by taking the average of Ua{x^ and Ub{x^ : 

w((1.2) -- K--541 -.947) = -.744. (7) 

Second trial value of Xi . Let Xi = + AX' be the second trial 

value of Xi , presumably better than X [ , and let uc be the solu- 
tion of 

u' +■ (Xi^ — = 0 (8) 

that equals 1 at a; = 0 and has a horizontal tangent there. Then 
Uc = (ic* — Xi)^^ — A\'uc) (9) 

and integrating this between 0 and Xi , we get 

Uc(xi) = ^ (x^ — \[)uc dx - AX'^ Uq dx, (10) 

Let us now assume that in the region from 0 to Xi the function uc 
does not differ much from ua , so that it is permissible to replace 
Uc by Ua on the right of (10) : 

/ XI /**! 

{x^ — \i)ua dx — AX' / Ua dx, (11) 

Since Ua = , the first integral on the right of (11) is 

just Ua(xi), namely —.541. The second integral can be obtained 
either from the numerical values of ua. not given here, or graphi- 
cally from Fig. 49; it equals approximately .90. Equation (11) 
thus yields 


Ub 


.504 

.0001 


-.947 

.0000 


Uc(L2) .541 - .90AX'. 


( 12 ) 



184 


THE SCHEOEDINGER METHOD 


§33 


Using (7), our estimate of the correct value of the derivative at 
1.2, to eliminate z^c(1.2) from (12), we find that AX' ^ (.744 — 
.541)/.90 = .23, so that 

y!' = .87 + .23 = 1.10. (13) 

Second rightward' and leftward integrations. Equation (8) now 
becomes 

w" + (1.10 ~ x")u = 0; (14) 

we compute in the region from 0 to 1.2 its solution Uc such that 
wc(0) = 1 and Uc{0) = 0 (Fig. 49 and Table 6), compute in the 
region from 3.0 to 1.2 its solution Ud such that ^<2(3.0) = .001 and 
Wd(3.0) = 0 (Table 7), and then multiply Ud by .366/3.798 = .0964 
to get the solution ur> that equals at = 1.2 and has a hori- 
zontal tangent at == 3.0 (Fig.. 49 and Table 8). The derivatives 


Table 6 ; = 1.10 Table 7 : X / = 1.10 Table 8 : xj ' = 1.10 


X 

Uc 

u'a X 

Ud 

u'd X 

Ud 

Ui^ 

0.0 

1.2 

1.000 

,366 

.000 1.2 

-.762 3.0 

3.798 

.001 

- 6.952 1.2 

.000 3.0 

mm 

H 


of Uc and Ud do not match at 1.2 (Tables 6 and 8), so that 1.10 is 
not the correct value of Xi ; but as seen from Fig. 49 the curves 
'uc and Ud join at a;i = 1.2 much more smoothly than do the curves 
ua and Ub , and hence 1.10 is a decided improvement over the 
first trial value .87. 

Third trial value of Xi . The respective ratios of the slopes at 
xi — 1.2 of the solutions obtained above by the rightward and the 
leftward integrations are 

' X = .87: u^{xi)Iub{xi) = (-.541)/(-.947) = .571 (15) 

X = 1.10: Uc(xi)/ud{xi) = (-.752)/(-..670) = 1.122. (16) 

For the correct value of Xi , the ratio would be 1; so we interpolate 
between (15) and (16) for the value of X corresponding to the 
ratio 1, and get 

XI" = [.87(1.122 - 1.000) + 1.10(1.000 - .571)]/ 

[1.122 - .571] = 1.049 (17) 


as our third approximation to Xi. 




185 


APPROXIMATE COMPUTATIONS 


§33 


Concluding remarks. It appears that the lowest eigenvalue of 
(4) lies between .87 and 1.10, and that the value 1.049 is likely 
to be a good approximation; further approximations can be made 
by further two-way integration. If high accuracy is desired, the 
numerical integration must at an appropriate stage be made 
correspondingly accurate and the point at which the finaf left- 
ward integration is begun must be chosen with particular care. 
As we shall see below, the value of Xi , correct to at least four 
significant figures, is 1.0605, so that the result 1.049 obtained 
above is in error by about 1 per cent. 

Approximate numerical eigenvalues and eigenfunctions of (1) 
can of course be obtained from those of (4) by means of (2) and (3). 

Milne’s method/ This method is particularly expedient when 
several eigenvalues and eigenfunctions of a given Schroedinger 
equation are to be computed. Adapting the results of Exercise 7®^ 
to our special equation (4) and setting^ a;o = 0 and a = we 
find that, if w i? the solution of the equation 

tt;" -1- (X — x^)w = 'KvT^ (18) 

such that w{(S) == 1, w'(0) = 0, (19) 


then the condition 




Table D 


j w ^ dx = n, n = 1, 2, 3, • • • (20) 

is necessary and sufficient in order that X be an eigenvalue of (4). 
Rough estimates show that Xi and X 2 are 
likely to lie between .5 and 4.5; so we set X, 
in turn, equal to 1, 2, 3, and 4, and for each 
of these values of X compute numerically 
the solution (19) of (18), starting at a; = 0 
and ending at a point beyond which 
is sufficiently small.® We then compute 
the left side of (20) [which we denote by /] 


X 

J 

1 

.9699 

2 

1.3995 

3 

1.7487 

4 

2.0602 


® To shorten the preliminary work, the earlier approximations may be 
obtained without using too large a value of X 2 . 

^W. E. Milne, Phys. Rev., 36, 863 (1930>. 

« The choice a;o = 0 causes w to be even in rc, so that the computations 
may be restricted to positive values of x; the choice a = X* then makes w 
particularly flat at a; * 0 (Exercise 6), thus simplifying the numerical 
work still further. 

® A sample computation is given in detail in reference 4, 



186 


THE SCHROEDINGER METHOD 


§33 


for each of the four X’s, getting 
Table 9 and Fig. 50. Inspec- 
tion of these results in the 
light of (20) shows that the 
first eigenvalue is a little 
greater than 1, while the 
second is a little smaller than 
4, and interpolation by means 
of Table 9 yields Xi = 1.0605 
and Xs = 3.7998. Substituting 
these results into (2), we get 
the following rather accurate estimates of the two lowest eigen- 
values of (1): 

= 1.0605 Ei = 3.7998 (21) 

Milne’s method for computing eigenfunctions is outlined in 
reference 4. 

Exercises 

1. Explain why we may expect, on the basis of Tables 3 and 5, that 
Ui{h2) hes within rather than outside the limits -.541 and -.947. 

2. Suppose the solutions ua and ub of (6) and uc and ud of (14) to he 
extended rightward in Fig. 49; then describe the behavior of each for large 
values of x. Note that .87 < Xi < 1.10 < X* . 

3. When dealing with (4) we should choose xi greater than the fourth 
root of the trial value of X. Why? 

4. Outline the steps for the calculation of the second eigenvalue of (4) 
by two-way integration. 

6. Propose a procedure for using two-way integration to compute eigen- 
values and eigenfunctions when V(x) is not even in x, 

6. Show that the first five derivatives of the solution (19) of Equation 
(18) vanish at a; « 0 . 

34, Expansions in Powers of h 

The next method of approximation, to be outlined consists in 
expressing the solutions of the Schroedinger equation 

r + 5 [s - vm = 0 (1) 



(1926), H. Jeffreys (1923), H. A. Kramers 
( ^ ), an G. Wentzel (1926), and in quantum-mechanical discussions 
varioudy called the J. W. B. K. method, the B. W. K. method, the W. B. K. 
me 0 , and so on. For a rigorous discussion, extensive references, and 



187 


approximate computations 


§34 


in a form involving power series in A and then neglecting the 
higher powers of A. 

We look for solutions of (1) of the form 

^ ( 2 ) 
where O' is a constant and is an as yet undetermined function of x 
which, as seen by substituting (2) into (1), must satisfy the 
equation 

ifut>" - (<^0" + 2m[E - V(x)] = 0. (3) 

To get an approximate solution of (3), we express <f> in the form 

<l>(x) = <l>oix) + fui>x(.x) + h%(x) + ..., (4) 

where the subscripted 4>'s are independent of h, and assume that 
on account of the smallness of h the first two terms in (4) give a 
sufficiently good approximation to a close scrutiny of (3) is of 
course required to tell just how good this approximation may 
actually be for various values of z. Substituting (4) into (3) 

and neglecting powers of h higher than the first, we then get the 
equation 

{2m[E -V]~ {<!>', Y] + = 0. (6) 

In order that (5) may hold identically in h, each { } must vanish 
separately, so that 


<^o = ± V" 2m[E — V] (6) 

<f>'i = \i4>i/4>a = §1 ^ log . ( 7 ) 

Integrating (6), we get ^ = ±/ -v/2wiiF^“Fj da;, that is, 

^ ^/2m[E - V{z)] da; + Cl, (8) 

where *0 is an arbitrarily fixed value of x and ci is an arbitrary 
constant. Intonating (7), we get <h = log 0^ + c* ; this result 
IS inconvenient if is negative, and therefore we replace it by 


cautions concerning this important method, see Kemble, Section 21, For 
an application of this method to the classical problem of the vibrations of a 
string of nonuniform density, see J. C. Slater and N. H, Frank, /niro- 
ductwn to Theoretical Physics [hereafter cited as Slater and Frank], Chap- 
ter XIV; New York and London: McGraw-Hill Book Co., 1933. 



188 the SCHROEDINGER METHOD §34 

= h log I I 4- C 2 , recalling that the logarithm of a negative 
nmction differs only by an imaginary constant from the logarithm 
of the absolute value of the function, and that ca is an arbitrary 
constant in any case. Thus we get the approximate equation 

<l>(.x) ^ 4,o{x) + \ih log I I , (9) 

where is given by (8) and cj is absorbed in ci . Substituting (9) 
into p) and rearranging the result, we finally get our approximate 
solution in the form 


Kp. = C{2m\E- y(a:) |}-i 

exp jT V2m[E - V(x)] dx^ (10) 

arbitrary. The two solutions contained in (10) 
and differmg in the sign of the exponent are linearly independent, 
and hence the approximate general solution of (1) is 

Kv. = {2m 1 - 7 1 exp 0 \/2i4E^V] dx^ 

+ ^ (“i/o - 7] (11) 

where A and B are arbitrary constants; an alternative form of (11) 

IS 


Kp. - C{2in\E - V 1} ^cosj^ jf ■y/2m[E - V]dx-{-^, (12) 

where C and 0 are arbitrary constants. 

The approximate solutions (11) and (12) of the Schroedinger 
equation are usually called W. B. K. .^-functions, or B. W. K 
functions. 



Connection formulas. Consider now the case 
61) y when V is greater than E everywhere 
in region I lying at the left of a;i . Then set 
^0 — oji in (ll)j and recall that the precise solu- 
tions of (1) that are of interest to us are re- 
quired to be well-behaved. Writing [E ^ Vf 
^ remembering that a; ^ iCi in region 



189 


APPROXIMATE COMPUTATIONS 


§34 


I, we then find that as a; — > — cb the A -term in (11) increases 
without limit while the B-term approaches zero. Hence, in order 
that (11) may approximate a solution of (1) which remains finite 
when X — 00 ^ wo must set A = 0 and adopt the function 

(13) 


as the appropriate approximation for region 1. Incidentally, at 
X = Xi the function (13) becomes infinite on account of the factor 
{ and hence, although (13) may be a good approximation to a 
precise solution of (1) for values of x lying sufiiciently far to the 
left of Xi , it certainly departs violently from this solution when 
X Xi ; this departure is illustrated in Fig. 54, which presents the 
correct i^-function for the fifth energy state of an oscillator together 
with a W. B. K. approximation to this function. 

In region II of Fig. 51 no limitations are put on the arbitrary 
constants, and we may take the approximate general solution to 
have the form (12). 

Now, how should the adjustable constants in (12) be chosen in 
order that (12) may connect properly with (13), that is, in order 
that (12) may approximate in region II the same exact solution 
of (1) which (13) approximates in region I? A complicated 
computation, which we orxiit, gives the following answer: the 
approximate solution for region II that connects properly with 
(13) is 

^app. = 2B{2m[E — 7]}“^ cos j ^s/2m[E — V]dx - J7r|, (14) 


which is obtained from (12) by letting C = 2B, 
xq = xij and $ == —Jir. Incidentally, from the 
elementary standpoint, (13) and (14) do not join 
at all smoothly at Xi , where both are infinite; but 
both are, of course, poor approximations near Xi . 

For the case (Fig. 52) when V is greater than E 
everywhere at the right of X 2 , it is found that the 
appropriate approximation for the region III is 





A{2m[V ^ E]r^e 


1 f^2 


VimlV-B] dx 


(15) 



190 


THE SCHROEDINGER METHOD 


§34 


and that the function approximating in region II of Fig. 52 the 
same exact solution of (1) that (16) approximates in region III is 

= 2A{2m[E - 7]}~*cos|| j V2m[E-V]dx + i7r|. (16) 

The bound case. We turn next to the bound case, restricting 
ourselves to a potential V such that V vanishes only once, as in 
Fig. 53. According to theorem 25 of §32, the energy in this case is 

quantized, a fact that shows up in 
our present approximate work as 
follows: On the one hand, V'app. 
must have the form (13) in region 
I, so that in region II it must have 
the form (14); on the other hand, 
^app. must have the form (15) in 
region III, so that in region II it 
must have the form (16). Consequently, the proper connections 
can be made only if the functions (14) and (16) are equal to 
each other, that is, only if throughout region II we have 

® “■ {U, - r} - ^ {I /' + i*-}- 

where J stands for J \/2m[E - V] dx. Now, differentiating (17) 
with respect to x and canceling the factor common to both sides 
of the result; we get an equation precisely like (17), except that 
the cosines are replaced by sines; and combining this equation 
with (17), we get 



Since the period of the tangent is t, the arguments of the two 
tangents in (18) must differ by titt, where n is an integer or zero, 
and we finally get 

2 V2m[E-Vix)] dx = in ■\- ^)h, n = 0, 1, 2, • • • , (19) 

where negative n’s are suppressed, since the left side of (19) is 
positive. For a preassigned F, Equation (19) is a condition on E, 
and consequently only for particular £?'s is it possible properly to 
connect our approximate ^^s. The condition (19) is thus an 




APPROXIMATE COIAPTJTATIONS 


191 


§34 


approximate qvanMzaiion rvk for the energy when the potential 
has the form shown in Fig. 53. 

Examples. To illustrate the use of (19), we shall employ it to 
estimate the energy levels for the case of the particular potential 
Y(jc) = Kx*^. The integrand ia (19) is now even in x, and the 
claChsical limits of motion for a particle of energy E are = 
— and Xi = (JEfK)^, so that (19) becomes 

V2in[E -Kx^]dic = (n +i)h, n = 0, 1,2, • • • . (20) 

The substitution y = Kx*IE sends (20) into 

(1 — y')^y~*dy = (ra -1- f)A. (21) 

The integral in (21) is approxiniately* 3.60, so that, writing 
for E, we finally get 

En = .S7{2n -h lyr^K?, n = 9, 1, 2, • • • (22) 


as the approximate formula for the energy levels. The two lowest 
levels are given by (22) as 

Eo = El = Z.17k~*E^. (23) 



Fig. W, Top.'aSchroed- 
inger function. Bottom: 
the W.B.K. approxima- 
tion to this function. 


Comparison with 21^* shows that, when 
y = the formula (19) underestimates 
the lowest level hy about 18 per cent, 
and the second level hy about 1 per 
cent; this formula usually gives better 
results for the higher levels. 

The B.W.K. V'-functiou for the fifth 
energy state of a linear harmonic oscil- 
lator is shown in Fig. 64, together with 
a correct Schroedinger function for this 
state. The corresponding distribution- 
in-a;, fapp.^app., is shown in Fig. 55; 
note that in the inner region this dis- 
tribution is a compromise between the 


® See, for example, H. B. Dwight, Tables of Integrals and Other Mathe- 
maticcl Dadj formula 855.2 and table 1018; New York; The Macmillan Co.| 
1934. Or B. 0. Peirce, A Skort Table of Integrals^ formula 482 and the 
table of logarithms of the gamma function; Boston: Ginn and Go. 



192 


THE SCHROEDINGER METHOD 


§34 




Fig. W. Top.-aSchroed- 
inger distribution-in- a: 
and the corresponding 
classical distribution-in- 
X. Bottom: the W.B.K. 
diatiibution-in-ar for the 
same state. 


classical and the correct quantunci- 
mechanical distributions, both of which 
are shown at the top of the figure. 

A coraparisoDL with the Solir theoiy. 
In the Bohr theory, the energy of a 
particle moving in the field shown in 
Fig. S3 is quantized by the Wilson- 
Sommerfeld rule, which states that 

2 pda: = nh, where p is the momen- 

jxi 

turn of the particle at the point x, and 

n = 1,2,3, ■ ■ ■ ; this rule can be rewritten 
r*s 

as 2 I \/2m[E—V(x)] dx = nh. The 


Wilson-Sommerfeld rule and the in 
^neral more accurate approximate rule (19) may be contrasted 
^saying that the former uses irAeger quantization and the latter 
odd Mf-inieger quavtizaiion. Note that the Wilson-Sommerfeld 
rule bears some resemblance to the exact formula 26®. 


Exercises 


Show that (19) yields (fortuitously) the correct energy levels for the 
^onic oscillator. Remember Fig. 54, however. 

^ ^here c is a positive coastant, then the 
PPTonmie energy levels given by (19) are E„ = I3c(« + J) A/SV^]*. 

particle Saw"™; Wilson-Sommerfeld rule to a 

particle m a box and compare the results vith 5»» 

ia ^ and shorv that the factor ( }-i 

factor that is distribution-in-., ^ 

tberenminbgLctor^n'rrl^"^^ 

17“, and show (essentiall ' p®*®“**®'* with a slanting bottom, Fig. 

W. B. K. methld gives aVulUnTTf 
aponding classical result of Exercise Ti' 

for the lower energies beoaim. f i ’ ^ ^he agreement 

aweliable; but see Exercise 8»*. *’® 

does not invalida^ Piiysioal dimensions of in (7) 



193 


APPROXIMA-TE COMPUTATIONS 


§35 


36. A Variation Method^ 

In preparation for the next method of approximation, we shall 
say a word about ordinary curve fitting. If a given curve (say a 
trace made by a recording instrument), whose correct equation 
V = F{x) is either unknown or inconveniently too complicated, 
happens to be approxinaately straight, it is often expedient to 
write 2/ = to adjust the numerical values of the constants 

a and 6 so as to obtain a 'best fit' to the given curve, and then to 
use the linear equation as an approximation to the correct one. 
Similarly, if the given curve resembles a parabola, the function 
y — hx c may be profitably fitted to it; in general, if a 

^ven curve has a shape resembling that of some simple function, 
it is sometimes expedient to adjust the numerical constants in 
this function so as to fit the given curve best, and then to use the 
resulting function as an approximation to the given curve. The 
process of curve fitting consists essentially of three steps: 

I, to choose a function J(x) that has the appropriate general 
shape and contains adjustable constants a, 6, • • • , 

II, to decide on a criterion of 'best fit,' and 

III, to adjust the constants a, b, • • ■ , in f{x) so as to obtain 
the best fit. 

In its most elementary aspect, which alone we shall consider, 
the variation method for approximate solution of a one-dimen- 
sional first Schroedinger equation can be viewed as essentially a 
process of curve .fitting, consisting of the steps I, II, and III 
enumerated above. To be sure, in ordinary curve fitting we fit a 
chosen function to a given curve, while in the variation method wc^ 
fit a chosen function to an unknown solution of a given differential 
equation; the underlying ideas of the two types of computations 
have, nevertheless, much in common. In describing the method, 
we shall restrict ourselves to approximating the lowest eigenvalue 
Eo and' the corresponding normalized eigenfunction of the 
Schroedinger equation pertaining to bound motion. 

I. The choice of the approximating Our first step is to 
write down a function \p that has the same general shape as the 

® For important ramifications and extensions of this method (which, 
from the ^thematical standpoint, is best approached through the calcu^ 
Ins of vanation) and for references see, for example, Pauling and Wilson, 



194 


THE SCHEOEDINGER METHOD 


§35 


unknown and contains some adjustable constants a, 6, * • • . 
This step is feasible because the theorems of §32 enable us at once 
to enumerate several important features of the unknown • 
Thus ^ 0 , being the lowest eigenfunction for a bound motion, 
can be taken as real, vanishes when \x \ oo , has no nodes at 
finite values of x so that it can be taken as ever 3 rwhere positive, 
and has one maximum and no minima; if V(x) happens to be even 
in X, then is also even in x. A person familiar with the shapes 
of ^-fimctions in a variety of special cases could perhaps infer 
additional information about ipo by inspecting the given V{x) 
more closely. We shall take it for granted in the sequel that ^ is 
normalized. 



To illustrate: If 7 =» Kx*t 
say, then vanishes when 
] X I — »■ 00 , has no nodes at finite 
values of x, is even in x, and, 
if taken as positive, has a max- 
imum at X 0. Several simple 
functions of this general shape 
come to mind, among them the 
function » -^(1 + 
where N is the normalizing 
factor and a is a single ad- 
justable constant that we shall 
take to be positive; written 
out in detail, this function is 


( 1 ) 


4 / 1 / - 


Fig. 66. Graphs of the function (1). 


Stt (1 "h a*x*)*^ 

graphs of (1) for three values 
of a are shown in Fig. 66. 


, Thus, for example, in dealing 

mth the case V =» Kx\ we may adopt (1) as the function to be fitted^® 
(by an adjustment of a) to the unknown . 


Extraordinary coincidences excepted, the approximating yj/ 
chosen in this manner is of course not precisely equal to the 
unknown even for special values of the adjustable constants; 


“ Imidentally, the function (1) would be a rather bad approximating ^ 
when V =* because it does not decrease sufficiently fast when | x | 
grows large. Indeed, for large values of | x |, Equation 1®® becomes 
^ ^ kKzS^; for large values of | x |, approximate solutions of the latter 
equation are , where y is arbitrary and «= kK; the function (1), 
on the other hand, behaves for large values of | x | as x“^ 


195 


APPROXIMATE COMPUTATIONS 


§36 


instead, ^ is a superposition of the various unknown instantaneous 
Schroedinger eigenfunctions of H. But, since \[/ has substantially 
the same shape as appears in this superposition particularly 
prominently; in other words, the approximating ^ represents 
instantaneously a state for which the energy of the system is not a 
certainty j hut is particularly likely to be Eo . 

II. The criterion of best fit used in approximating the lowest 
eigenfunction by the variation method can be set up as follows: 
Let Eo be the lowest energy level of the S 3 rstem (that is, the lowest 
possible result of a precise measurement of the energy of the 
system). Let av E be the expected average of the energy (that is, 
the probability-weighted arithmetic average of the possible results 
of a precise energy measurement) of this system for some state 
that is either the normal state, or a higher energy state, or, in 
general, a superposition of the various energy states. It is then 
perhaps obvious (if not, recall Exercise 17^^) that av jB > Eo- 
If the state in question is specified by a Schroedinger function \J/, 
so that we may replace the symbol av E by av^ E, this result 
takes the form 

av^ E> Eo* ^ (2) 

The = sign in (2) applies if and only if \[/ is the Schroedinger 
function specifying the normal state of the system. 

Now, if ^ is normalized, then, according to the Schroedinger 
expectation formula 2^*, 

av^ El = j dx, (3) 

where H is the Hamiltonian operator of the system; and, combining 
(3) with (2), we conclude that 

J rpHik dx > Eo , (4) 

whenever ^ is normalized. The relation (4) holds, of course, 
whether \f/ is an instantaneous or a time-dependent ^-function. 

The relation (4), taken together with its special case 

j \poHfodz = Eo , (6) 

suggests the following criterion of good fit, adopted in the variation 
method: of two normalized ^^5, the one for which the integral 
J dx has the lesser numerical value is the better approximation 
to \po . 



196 


THE SCHROEDmaER METHOD 


§35 


III. Adjustment of the constants in Substitution of the 
approximating ^ into / fR\j/ die yields a number inYolving the 
adjustable constants a, h, • • ■ , appearing in tp. According to the 
criterion quoted above, to make yp fit the unknown po best, we 
must choose these constants so as to make / ^H\p dx as small as 
possible. Thus our problem reduces to that of nainimizing a 
function of o, b, ■ • . , so that, to determine the best values of a, i, 

• • • , we need only to solve simultaneously the equations 

£jmdx = 0 , 1^1 ( 6 ) 

To illustrate, we return to the potential V =: Kz^ and to the approxi- 
mating function (1). We have 

^ xpHil/ dx 

■ /[* ^ -/j-. <' + 

16flt /*r4a^ 24a^ “I 

“57 / |_T - — *“0 + +■ \ax, (7) 

and evaluation of the integral yields 

< 8 ) 

Thus 

so t^t the value of a that minimizes f xpH\f/dz satisfies the equation 
14 o/5k — llK/lOa* — 0, whose positive solution is 

a = (2sZ/7)‘ S .81/ciK*. (10) 

Substitution of (10) into (1) now yields that function rhich, among all 
fimctioM of the form (1) with a positive a, is (according to the criterion 
^opted above) the best approximation to if-o for the case V = K®*; when 

jnst two significant figures are kept in the numerical coefficients, this 
function IS 

The approximate value of Bq, Since the relation 

-E'o < J ctx (12) 



197 


APPROXIMATE COMPUTATIONS 


§35 


holds for any normalized representing instantaneously a state of 
the system, we can get an upper limit to Eo by substituting any 
normalized ^ into the right side of {12). In particular, 
having chosen a ^ of a certain form and having minimized the 
right side of (12) according to our criterion of best fit, we get as 
low an upper limit to Eo as it is possible to obtain with this form 
of yp. 


Por example, in the case V - Kx\ it follows from {12} and (8) that 


^ 5k 6a^ 


( 13 ) 


for any real a, and7 reducing the right' side of (13) to a minimum by using a 
from (10), we get 


jB?o < 



^ I.ZSk-^KK 


( 14 ) 


Thus we now know for certain that in the case V — Kx* the value of Eo 
(which in view of theorem 25 of §32 is positive) is not greater than 1.38 k"'*X^ 
and that the upper limit is as low an upper limit as can be ob- 

tained by m,eans of an approximating ^ of the form (1). Wince functions 
of the form (1) are not suited particularly well^® for approximating 
when V = Kx\ we may expect that the upper limit to Eo set by (14) is 
rather high; comparison of (14) with our previous result (note the differ- 
ence in labeling) 

(21««) Eo = 1,0605k-«X* (16) 

substantiates this surmise. 

If the approximating ^ is chosen with particular care (which 
was not the case in our illustrative computations for the case 
V = Kx*), we may sometimes be confident that the upper limit to 
Eo given by the variation method lies very close to Eo ; under 
these circumstances j the upper limit itself, that is, the minimum of 
f dx, can he taken as a good approximation to Eo . 


The formal proof of (4), in which we do not make use of physical con- 
siderations, runs as follows: We imagine the normalized function \l/ to 
be expanded in terms of the normalized instantaneous eigenfunctions of JH: 


^ i “ 0 , 1 ,. 2 , . ( 16 ) 

Operating on (16) with H, multiplying through by and integrating, we 
get, in succession, the following equations: — :SiCiH^pi = , 

= 'XidBixl/yj/i — 'ZiCiEi{^iCi\l/i)4^i = 'S,iZjEiCiCi\l^,ypi , and / g/H\pdx = 



198 


THE SCHROEDINGER METHOD 


§35 


IfiZiEiCiCi / dz, Itt view of tlie orthonormality of the ^i’s, it follows 
that / dx = XiE{ 1 a |^ so that, since J^o is smaller than any other 
Eft , we get / fEypdz ^ E^i | c» 1^, where the = sign is retained to allow 
for the possibility that all the c^s except co vanish in (16). The relation 
(4) now follows in view of 32®. 

Exercises 

1. Make a graph comparing (11) with the smoother curve in Fig. 49®®, 
which, as we know, is a good approximation to when Y = Kx^. Re- 
member the matter of normalization, and the fact that x in Pig. 49 stands 
for the variable 3®®. 

2. Explain why, in dealing with /'o when V == we may expect the 

function where c is an adjustable constant, to be a better approxi- 

mating ^ than (1) 

5. Show that the function iVe"***** best fits for the case V = K.x*^ 
when c « (3kH/ 8)* and that j&o < 1.09 k~^K^. Is the surmise of Exercise 2 
correct? 

4. Show that, when Y = then the inexactitude of the energy for 
the state described by the function JVe”*****®, with c = (3/cK/8)^ is approxi- 
mately What is the inexactitude of the energy for. a state de- 

scribed by a \^-f unction of the form (1) when V = Kx^l 

6. If the function V' = should be used to fit, by the variation, 

method, the lowest eigenfunction for a linear harmonic oscillator, what 
would the best value of c and the minimal value of / dx turn out 
to be? Verify by an explicit computation. 

6. Show that, if the motion is bound, if V is even in z, and if the nor- 
malized approximating V' is odd in z, then j fHyff dx ^ JSi , where Ei is 
the next to the lowest energy level. 

7. Show that, when it is inconvenient to normalize the approximating 
^ before evaluating the adjustable constants^S then the equations 
d{f dx/f da;)/9o - 0, 9 {f^E^dx/f dx)Jdh = 0, and so forth, may 
be substituted for Equations (6). 

36. A Perturbation Method for Koadegenerate States 

If the potential V differs only a little from a potential 7^ so 
that the term in the equation 

V = ( 1 ) 

is small/^ then, we may call 7, 7°, and 7^ the perturbed^ the 
unperturbed, and the perturbing respectively; the energy 

levels and the corresponding V'-functions (to be denoted by Eq , 
Ei,E 2 f • - • and , ^ 2 , • — ) for the potential V may be called 


i^For example, when the approximating ^ is so complicated as to 
necessitate numerical evaluation of the various integrals. 

For the present, the terms a little and small will be used vaguely. 



199 


APPROXIMATE COMPUTATIONS 


§36 


the perturhed levels and ^-functions, and we may refer to the 
levels and the corresponding ^-functions (to be denoted by 
, JS? , , • • • and , 4 * 1 , > ’ ■ ■) for the potential as the 

unperturbed levels and ^functions. Further, we may then expect 
that each of the perturbed E’s and ^’s differs but little from the 
corresponding unperturbed quantity; in particular, if the per- 
turbed potential has the form 

7 = 7 “ 4 - + ■■■ , ( 2 ) 

where € is a small parameter and the are functions not involving 
€, then we may expect that 

= El -I- ee\ -(- -h -!-•••, (3) 

where the e's are constants independent of e, and that 

-t- efl -h 6*/? + + • • ■ . (4) 

where the fa are functions independent of e. 


To illustrate Equations (2), (3), and (4); we consider the normal state 
of a linear harmonic oscillator (restoring constant k) perturbed by the 
potential so that 

7« = V = ikx^ + i(kx% (5) 

and assume that — 1 < e < 1. In terms of our standard symbols vc and cLj 
and the temporary abbreviation c* the lowest level of the unper- 

^ turbed system is 

(7“) El = ihp. = icni, (6) 


and the corresponding normalized i^-function is^* 


(1218) 






e 2\a/ 






a* . 

2c2 


(7) 


Now, the perturbed potential being F = i(fe + ek)x^, the perturbed system 
is simply a linear harmonic oscillator with restoring constant /c(l H- e), 
so that Bo and can be obtained from (6) and (7) by merely substituting 
k(l +• e) for k: 

E, = ic^kKl + e)* = ihycd + e)‘, (8) 


and 

ipo = 


*‘(1 + e )‘ 

TT— exp 


fe*(l + 0* 
2c* 




(1 + «)* 


exp 




(1 + 

2«» 



” In treating the case (5), we restrict ourselves, for simplicity, to 
instantaneous and set the various arbitrary phases equal to zero, that 
is, set the phase factors equal to unity. 



200 THE SCHROEDINGER METHOD §36 

To complete our illustration, we must show that (8) and (9) ^ 

put in the respective forms (3) and (4). To do this, we write (1 + «>• - 
1 ^ ^ in (8) and get 

+ ‘life'cl + 6=1 - hhPc] + 6*1 1 H (10), 


•In a similar way/* we get from (9) the series 


lAo — ^5 + e 




•II + 


( 11 ) 


In the example just given, the perturbed system is so simple that 
its precise Fs and f’s can be readily computed even vrithout 
reference to the unperturbed system. It sometimes comes about, 
however, that, while the unperturbed system admits of precise 
treatment, the perturbed system does not; in such cases we turn 
to the so-called perturbation theory, designed for the approximate 
computation of the perturbed and ^^s in terms of the perturb- 
ing potential and the unperturbed E^s and p^s. This theory is 
rather extensive. We shall confine ourselves to one of its simplest 
and most useful forms, the Rayleigh- Schroedmger method, and 
shall restrict ourselves still further to cases when the motion is 
bound and the nth energy level of the unperturbed system is 
nondegenerate. (For bound one-dimensioTial motipn, the entire 
energy spectrum is nondegenerate; see theorem 25 of §32.) 

■ It is advantageous to elaborate our notation by writing , 

and so on, and so on, and 

= pi, eV? = and so on,^® that is, by using the following 
equivalent pairs of series for 7, En , and pn : 


V 

= F“ 

4- 




+ 


+ .••• 



= 7® 

-1- 



7“ 

4- 

ylll 

+ ... 

(12) 


= Ei 

+• 




+ 


4- ••• 



= El 

4- 

El 

-1- 


+ 


+ ... 

(13) 

'I'n 

= K 

4- 

6/i 

4- 

67i 

■ + 


+ ... 



= K 

■7 

/n 

4- 


4- 


4- •••• 

(14) 


‘^Taylor-Maclaurin theorem :/(€) ^ /(O) -H o H . 

« For example, in (5) we have ^ = ■ • . = 0 and 7* » 

iefci* - y^ 0; in (10) we have el - W. , ef , 

and soon, andiJj * hhvc, £?? = and so on. 



201 APPROXIMATE COMPUTATIONS §36 

The term is called the linear (or first-order) perturbing term, 
E\ the linear (or first-order) correction to the nth energy level, 
the linear (or first-order) correction to the nth i/'-function, 
the quadratic (or second-order) perturbing term, and so on. 

Our problem, then, is to compute the various corrections in 
terms of the available quantities (that is, in terms of the un- 
perturbed E^s and i/^’s, the parameter e, and the !;^s), it being 
known that the unperturbed and the perturbed i/^'s, which we shall 
take to be normalized, satisfy respectively the Schroedinger equa- 
tions 

+ KiEi - = 0, 

(15a, b) 

The linear energy correction. Substitution of (12), (13), and 
(14) into (15b) yields 

1 ^^ + + ••• — V' — ev^ 

— — • • Oj (^n + e/n + ifn + ' • 0 = 0) (16) 

multiplying this out and collecting like powers of e, we write (16) as 

{} + *{} + **{} + ••• =0; (17) 

the first { } in (17) is given in full on the left of (18), and the secoiid 
on the left of (19). Since (17) should hold identically in e, each 
{ } must vanish separately, and we get- the series of equations 

+ -7“)]^: = 0, (18) 

+ KiEl - 7“)]/^ + k(4 - = 0, (19) 

and so on. Equation (18) is just (15a), and yields nothing new. 
Equation (19) contains two unknowns, fn and ei , of which f\ 
can be eliminated as follows: We imagine fn to be expanded in 
terms of the unperturbed so that 

/« = f = 0, 1, 2, ... , (20) 




202 THE SCHROEDINGEE METHOD §36 

and, substituting (20) into (19), "we get the equation 
2iCi[dVd»* + «(^n - according to (15a) 

[^Jdx^ — (cYVi = and hence the last equation becomes 

-jE?)^S + (ci-t;Vn = 0. (21) 

We now multiply (21) by and integrate from a; = -« 
to a: = 00 . Because of the orthogonality of the unperturbed 
f’s, the terms in the summation in (21) for which i 9 ^ n then 
vanish, while, on account of the factor (£!« — JSl), the term 

t ^ a is zero even before the integration. Thus (21) reduces to 

fpliel - dx = 0, that is, to tx, f\ having 

been eliminated. Multiplying the last equation through by e, 
we get 

El = yV“ dx = Y\ (22) 

Thus the linear energy correction for a nondegenerate state is equal 
to the linear perturbing term averaged over the unperturbed state. 
Incidentally, the result (22) is the quantum-mechanical analogue 
of the classical perturbation theorem which states that the linear 
energy correction for a nondegenerate orbit is equal to the linear 
perturbing term averaged, vriith respect to the time, over the 
imperturbed orbit. 

To illustrate (22), we return to the case (6). Here so that 

(22) yields 

E\ = hk (23) 

where is (7). The integral in (23) is listed in 18^® as and conse- 
siuently 

El = idcEl/k = . (24) 

Thus the result given by (22) is just that required by (10). 

The lineal ^correctioii. To determine /J , it is siifhcient to 
compute the c’s in (20) ; so we mxiltiply (21) by fj with j 9 ^ n, 
integrate from z = to x — <*, and, remembering the 

orthonormality of the unperturbed i’a, get the equation 
c,'(£n — B?) — ^ fW'Tn.dx = 0, which., mth i written for j, 
yields the formula 

= - / ^W^lcb/(Efi - El), 


i 7^ n, ( 26 ) 



203 


APPROXIMATE COMPUTATIONS 


§36 


so that every c except c„ has been evaluated. The computation 
of c» can be based on the normalization of and , and will 
be taken up in Exercise 9; the result is 


= 0. (26) 

We now multiply (20) through by e and, using (25) and (26), 
get the formula 




riuvm 

_ Jzf 


dx 


. y'’. rtf ^0. 

E\ ~ E\ 


0,1,2, (27) 


where ^ is primed to indicate that the term with i = « is to 
be omitted in the summation. Thus the linear ^^-correction has 
been expressed in terms of the available quantities. 


To illustrate, we shall use (27) to compute \p\ for the case (5). 
P* = so that 


Vo 


y hiV^Vodx .0 „ y> 

VO ^ rjtO TpQ Pt* 

Jj/i “ JC/Q Mti — i&o 


Here 

(28) 


According to 43*, the integrals in (28) all vanish except those with i ■« 0 
and t =* 2 ; the integral with z = 2 is^® c®/\/2, and the integral with ^ « 0 
is not needed in (28), as indicated by the primed S. Hence (28) reduces to 






0VV2 ,0 

etO rrO ^2* 

xSo — JU/n 


(29) 


Since — j&J = ^hvc — \hvo , we finally get, taking from 7“, 



in agreement with (11). 


The quadratic energy correction. The equations obtained by 
setting each { } in (17) equal to zero forna an infinite set which 
is sufficient for the computation, one by one, of the higher-order 
corrections in (13) and (14) in much the same way that B\ and 
fn were computed above, although the procedure becomes in- 
creasingly more tedious. In practice it is seldom of interest to 
go beyond the linear ^-correction given by (27) and the quadratic 
energy correction, which turns out to be 








J£ 




Ei - m 


(31) 


f = 0, 1, 2, . . . , 



204 


THE SCHROEDINGER METHOD 


§36 


where the ^ is primed to indicate as before that the term with 
i = 71 is to be omitted. 

To illustrate, we shall use (31) to compute for the case (5). Here 
P = iekz^ and - 0, so that 


•B? = y.' . 

‘ El - E% 


( 32 ) 


The integrals in (32) all vanish except those for 7 = 0 and 7 = 2. The 
term with i = 0 is to be omitted, and (32) reduces to 




aV2 

Et-Et 


= —^e‘hvc, 


( 33 ) 


ia agreement with (10). 

Convergence. The Rayleigh-Schroedinger method outlined 
above is purely formal, and thus does not guarantee that the 
series it yields are convergent.^® Consequently it must be used 
with caution, and the results that it gives should, strictly, be 
examined for convergence; rigorous convergence tests, however, 
are in most cases too laborious to be feasible. Fortunately, the 
following rather simple though somewhat rough rule can be relied 
upon in many cases : the method yields the correct approximations 
to En if the quantity 1 ^ ° | (the shift that the perturbation 

caus^in the level in question) is small compared to the quanti- 
ties - En and E^ — El^i (the separations from its imme- 
^te neighbors of the unperturbed level in question). This rule 
15 convement because in practice it can be applied through an 
ex^ation of the experimental data, that is, prior to any 
elaborate computations. 

The s^t caused in a given level by a perturbation is said to be 
It IS small compared to the separations of this level from 
ite neighbors. The perturbing potential is said to be small, inso- 

WenAST I ^ tie shift that it causes in this 

’ ^ given unperturbed system, a given perturbing 


once again 

is (8). Now as 1 . 0 , '■ ^ whenever « > — 1, 



205 APPROXIMATE COMPUTATIONS §36 

potential can thus be small in the case of one level and large in the 
case of another. 

The Rayleigh-Schroedinger method is readily extended to 
three-dimensional motion, yielding formulas quite similar in 
structure to those obtained above. But, in other than one-dimen- 
sional motion, energy levels may be degenerate even in the bound 
case; if so, they must be handled by special methods, 'because in 
case of degeneracy the simple formulas derived above acquire zero 
denominators, so that the simple results are meaningless unless 
the corresponding numerators vanish also, in which case these 
results are indeterminate. Incidentally, it turns out that the 
levels which coincide in the unperturbed system may be shifted 
by the perturbation by different amounts, and thus may coincide 
no longer in the perturbed system; in such a case the perturbation 
is said to remove the degeneracy. Examples of this effect are found 
in atomic spectra: the unperturbed levels of an atom are usually 
degenerate, but perturbations (external electric or magnetic 
fields) remove this degeneracy at least in part, with the result 
that the levels and the spectral lines become split (Stark or Zee- 
man effects). 

Exercises 

!• Compute the precise value of En for the case (5), expand the result 
in powers of <, assuming that | € | < 1, and verify (22) and (31). 

2. Consider a linear harmonic oscillator perturbed by a uniform field 
« €hx^ so that V = px* + In the Schroedinger equation for the 

perturbed system, change to the new independent variable a: + c, solve the 
resulting equation precisely, and show that En^ El — ikeK Then verify 
(22) and (31), 

3. Expand of Exercise 2 in powers of e and, with the help of 36®, 
express in terms of the unperturbed disregarding e®, • • • . Then 
verify (27). 

4. Oi&cillators perturbed by potentials involving powers of x higher 

than the second are said to be anharmonic; their precise energy levels 
cannot be computed in a closed form, so that perturbation methods must 
be relied upon. Consider the case V = px® + ex®, and show that En — 
(n + i)hvo (156®/iV4m^®) (n® + n. + }J) • • • , Formulas of Exercise 5* 

are helpful in this work, 

6. Consider the behavior of the V of Exercise 4 for large values of \ x |, 
and indicate the conditions under which the expression for En obtained 
in Exercise 4 is certain to be faulty. 

6. Show that if the unperturbed motion is bound, if F° is even in x, 
and if is odd in x, then E\ ^ 0 for any n. 

7. Show that if the unperturbed motion is bound, if F® is even in x. 



206 THE SCHROEDINGER METHOD §36 

if is odd in x, and if the energy levels are labeled JSi , B 2 , • • • , then 

is odd in a: if 91 is odd, and even in a; if n is even. 

8. Consider the normal state of a particle in the potential box of Fig, 
17^^ assume that the bottom of the box slants only slightly, and show by 
perturbation methods that the particle is slightly more likely to be found 
in the deeper than in the shallower half of the box. [Hints: place the 
origin of potential energy at the center of the bottom of the box, and 
consider only the algebraic signs rather than the precise values of the 
nonvanishing integrals in (27).] Note that this result is not unexpected 
in view of a classical result of Exercise 14^^ What feature of (27) will 
allow the result to be just the opposite (recall Exercise 5®*) in the case of a 
highly excited state? 

9. Let 2? be a normalized real function of x involving a parameter e and 
admitting, in a certain range of e, the expansion 22 — 22** -f eR* H- €^22“ 
where the superscripted 22 's are independent of e; show that then 22^ is 
orthogonal to 22. Then recall Exercise 2®* and justify (26). 

10. Determine the effect of the phase and time factors of the unper- 
turbed ^’s on the linear t/^-correction (27). 

11. Expand (8) into a convergent series in e when e > 1. 



Chapter VI 


ONE-DIMENSIONAL PROBABILITY CURRENTS AND 
DE BROGLIE WAVES 


37, Probability Current 


Let the state of a one-dimensional system be specified by a 
normalized Schroedinger function t)j which, in the general 
case, need not be an eigenfunction of the Hamiltonian of the 
system; the distribution-in- 2 ;, is then in general time-de- 
pendent. A time-dependent # is illustrated in Fig. 57, where the 
full line represents for one instant 
of time and the dotted line for 
another. 

Now, the probability of finding the 
particle somewhere at the left of a point 
X at the instant t is 




( 1 ) 


Fig. 67. A time-dependent 
at two instants of time. 


and the time rate of change of this probability at the instant t is 


~ j ip^dx. (2) 

In view of 18^®, our ^-function remains normalized for all time, 
so that a decrease in the quantity (1) is accompanied by an equal 
increase in the probability of finding the particle somewhere at 
the right of the point x; similarly, an increase in (1) is accom- 
panied by an equal decrease in the probability of finding the 
particle at the right of the point x. Consequently we may say 
that the pTohability flows from left to right past the point x when- 
ever (1) decreases with t, and may define the term prohability 
current at x at the instant t to mean the time rate at which the 
probability flows from left to right across the point x at the 
instant L This rate, which we denote by 3 = S{x^ should 

207 



the scheoedinger method |37 

the panicle at 


>s 


= ~i r 


H^dx. 


(3) 


(nation is tha? ^ negative, the impli. 

^ard at the instant L * Probability current flows left- 

initB right side under the intesrsl^'^^^^^ <ltfferentiate 

tjtuch m the present case means that 

where. == ^ 


S = — ^ l’^/- 
integration by parts now yields 


(4) 


<S = 


h 

2m 


m 


i~r^' 


(5) 


4 . 1 . • , 1-00 A eo ^ ^ '^'^19 w 

>0-“te4 


s = 


2^ - f'^p)- 


-n . 

e niustr&te the use of 

SrS? 5" " pa£ 22- w'f " Particle ^hose state is 

^ at t = 0 tad eL ^ t'^ke the averages 

»w.. , CT 


( 8 ) 


that is, '^^^ssF+ipr3®*p{ 2(zrf^p-y|- 



209 


CURRENTS AND WAVES 


§37 


We note in particular that for i > 0 the current has the same algebraic 
sign as x and hence is everywhere directed away from x - 0, and that the 
absolute value of S starts everywhere with the value zero at i = 0, grows 
to a maximum, and eventually vanishes; these qualitative features of the 
current can of course be inferred directly by inspecting the distribution- 
in-a; for the state (7), namely, 




V2t(aI + AlP) 


exp 


/ 5!-.4 


( 10 ) 


An alternative method for obtaining (6) is as follows; Having 
defined the probability current 8 as the time rate at which 
probability flows from left to right past the point x at the instant 
we proceed to consider the time rate of change of the probability 
of finding the particle between the points x and x dx (Fig. 58). 
On the one hand [since the probability of 
fi.nding the particle at x in dx is ^ dx]j 
d “ 

this rate is ^ dz; on the other hand 


dt 




l^since the current at x is S(x, t) and 'the 

current at a; + da? is + dx, t) = 

S(x, t) + dx + • * 'I this rate, 

ox J 

correctly to the first order in dx, is — - dx; hence ~ da? == 


Fig. 68. 


— dz, that is, 

dx 


dx 


^ Ti fx 


( 11 ) 


Equation (11) is called the conservation equation (or continuity 
equation) for probability. • The steps that carry (3) into (4) carry 
(11) into bSJdx = — so that 



Integrating (12) with respect to a? and setting the arbitrary addi- 
tive ‘constant' of integration equal to zero, we once again get (6). 

This method avoids the use of the expression (2) — which may 
be meaningless if ^ is not normalizable — and enables us in fact 
to show (merely by substituting the terms ‘relative probability' 



210 


THE SCHEOEDINGEE METHOD 


§37 


and ‘current of relative probabilit 7 ’ for ‘probability’ and 'proba- 
bility current’ in the paragraph, just above) that whenever ^ is 
not normalized (or even not normalizable) the right side of (6) 
represents the eurrent of relative prolabilily flowing past the point 
z at the instant t. 


To illustrate the use of (6) when ip is not normalizable, we consider a 
free particle whose momentum has certainly the numerical value p, so 
that the Schroedinger function is 


f = (23) 

where E = pV2m, Substituting (13) into (6), we get 

/S = U Mm = 0 1 4 |»; (14) 

thus, fB we woidd expect by classical analogy, the current of relative 
probability m tto case equals the velocity of the particle multiplied by the 
relative probability of finding the particle in ai^ interval of unit length. 

M another example, we consider a free pdrticle in the state specified 
by tie Schroedinger function 


( 5 “) 




’ 

particle is certainly E 

relayvl momentum is either p or -p, the 

reaT»/>f! j *''^0 possibilities being | A and J B I*, 

Sari'S™ definiteness, the number p is taken 

»» ax P>«M= » <=.r- 

left to right is M l> probability that the particle moves from 

from right to left is I 5 |« probability that the particle moves 

duces to I I . The result of substituting (15) into (6) re- 


n * — 


and we recognize that the first term f ^ distinguishable parts, 

that the particle moves to the ?r possibility 

that the particle moves to the leff ’ second of the possibility 

Exercises 


*.nor p.in22»h^Mro**'* just below (6) to the case when neither 

that at < =oJc£erib!dbyla in tbe state 
3. Let ip be such that W la * ®itt cat, where pp is 20*r. 

chan^ of shape; deduce fLm f3) wS +^”® ^CO without 

.‘’f Exerote^ ^ 

dimensional system'^ Sep* energy state of a one- 



211 CURRENTS AND WAVES §37 

6. Show that the probability current for any energy state of a bound 
one-dimensional system is zero. 

6. Show that the results of Exercises 4 and 5 are just those suggested 
by classical analogy. 

7. Show that, if V' is normalized, then / /S da; = av^ v. 

8. To exhibit a curious effect of coherent quantum-mechanical super- 

position of states, compute the probability current for a free particle when 
^ with El 5^ and with fixed A and B, 

and note that the result is not that suggested by classical analogy; but 
note that the classical result can be obtained from the quantum-mechanical 
one by averaging the latter with respect to t. Show also that, if a state 
of a free particle is an incoherent superpositioi.^ of two energy states of 
different energies, then the current is that suggested by classical analogy. 

9. Explain why neglecting the additive ^constant' in the result of 
integrating (12) is permissible in evaluating the probability current. 

38. One-Dimensional Potential Barriers 


A rectangular barrier. If the force field acting on a particle is 
zero or nearly zero ever 3 rw’here except in a limited region (as for 
example in Fig. 64), it is said to comprise a potential harrier. We 
shall consider first the rather 
idealized barrier of Fig. 59, re- 
stricting ourselves to the case 
when 


0 < £? < 7o , (1) 





♦ 

i' 


K 

t 

* 


1 

2 

1 

1 


Fig. 69. A rectangular potential 
barrier. 


E being the total energy of the 
particle and Vo the 'height' of the barrier. 

Incidentally, according to classical mechanics, the barrier of 
Fig. 59 is impenetrable for a particle satisfying (1), that is, the 
condition (1) precludes the possibility that a classical particle 
should be found in the central region (— < a; < |Z) or should 
pass from one of the outer regions into the other. For example, 
if a classical particle satisfying (1) is once certainly in the left 
region (x < — |Z), then it will remain in the left region forever; 
in particular, if this particle is at some time moving toward the 
barrier, it will be reflected back upon reaching the point x = 
and will from then on move away from the barrier. Again, if it 
is uncertain whether a classical particle satisfying (1) is at some 
instant ti in the left or in the right region, then the probability 
that the particle is in the left region at any later instant is the 
same as that at the instant h ; in other words, from the classical 



212 


THE SCHROEDINGER METHOD 


§38 


standpoint, no probability current can flow across the barrier of 
Fig. 59 under the condition (1). These classical results hold of 
course whether the energy of the particle is a certainty or not, pro- 
vided it is certainly smaller than Vo ■ 

Any energy greater than zero is a quantum-mechanieally 
possible energy value of our system (Exercise 9^), and the Schroe- 
dinger eigenfunction of the Hamiltonian of the system belonging 
to the energy value E such that 0 < JS < Fo is, apart from the 
time , factor 


(28®*) 



= Ae*“ -1- (Ac2 + 

= (Ac 4 -f H- (Acs + 5c4)e"^* 


= (Aci + 


( 2 ) 


where the c’s are the constants listed in Exercise 9®*, A and B 
are arbitrary constants, 


a = v^, 0 = VK(Vf, ~ E), (3) 

aad the subscripts Z, c, and r stand for left, central^ and right. 

Reflecticn and tranmiission of a 'proiahility packet. To study 
the quantum-mechanical behavior of a particle impinging on our 
barrier from, say, the left side, we must consider a state for which 
the particle is at some initial instant moving in the left region 
toward the barrier, and must set up a ^-function describing such 
a state; since no eigenfunction of our H vanishes everywhere 
except in the left region, the desired i^-function will have to be a 
superposition of the eigenfunctions of H. Further, if we wish to 
consider eases when the energy of the particle is certainly less 
than V 0 , we must restrict ourselves to i/'-f unctions that are super- 
positions of only those eigenfunctions of H which belong to E^s 
lying below V o . The latter limitation turns out to make it im- 
possible to construct a ^-function that vanishes everywhere except 
in the left region; in other words, the claim that the energy of the 
particle is certainly less than Fo is quantum-mechanically incom- 
patible with the claim that the particle is certainly in the left 
region. But it is possible to construct states for which the energy . 
of the particle is certainly less than Fq and for which, at i = 0, 
the particle is as likely td be in the left region as we please; a 
state of this type was studied by MacColL^ 


. ^acColl, Phys. Rev., 40, 621 (1932). The author is indebted 

o r. MacColl for advice concerning Fig. 60 and the accompanying text. 



213 


CURRENTS AND WAVES 


§38 


At < = 0, the distribution-in-a:, for MacColl’s state is a 
. packet whose bulk is located in the left region and whose center 
is moving toward the barrier. The subsequent behavior of this 
packet, indicated in Fig. 60 without regard for the finer details, 
turns out to be as follows: At fitrst the packet moves toward the 
barrier with uniform velocity and spreads at the same time in 
much the same way as does a packet in the field-free case ; as the 
bulk of this incident packet approaches the barrier, a transmitted 





Fig. 60 . Successive graphs of MaoColl’s packet colliding with the rec-. 
tangular barrier shown at the top. The vertical scale in the diagram of the 
barrier is of course unrelated to that in the five graphs of packets. 

packet emergOs from the far side of the barrier, while a reflected 
packet forms at the near side and recedes leftward; and eventually 
the incident packet becomes replaced by two packets,: the trans- 
mitted one, moving uniformly rightward (and spreading) in the 
right-hand region, and the reflected one, moving uniformly left- 
ward (and spreading) in the left-hand region. The formation of 
the transmitted packet implies, of course, that the probability of 
finding the particle in the right-hand region has grown from a 
negligible value to a sizable one. 




214 


THE SCHROEDINGER METHOD 


§38 


MacCoirs ^-function is a superposition of only those eigen- 
functions of H which belong to E*s smaller than Vo , so that the 
possibility that the energy of the particle is greater than Fo is 
excluded. Hence the results described above lead to the physical 
conclusion that a particle impinging on our barrier with energy 
certainly smaller than Vq will not necessarily be reflected by the 
barrier j hut may pass across the harrier and continue its forward 
motion. 

If a particle whose energy is certainly smaller than the height 
of a barrier should pass across the btoier, it is said to have gone 
through rather than over the barrier. This possibility of going 
through potential barriers— called the tunnel effect — ^makes it 
possible to understand in terms of quantum mechanics a number 
of atomic phenomena that are inexplicable classically. 

The probability that a particle in a given state should go through 
or over a given barrier is called the transmission coefficient for this 
state and is denoted by T] the number 

22 = 1 r, (4) 

that is, the probability that the particle should be reflected by the 
barrier, is called the reflection coefficient For a particle in a state 
of the type discussed by MacCoU, T is the ratio of the final area 
of the transmitted packet to that of the incident packet, and R is 
the ratio of the final area of the reflected packet to that of the 
incident packet. 

Reflection and transmission of probability currents. The dis- 
cussion above was concerned with a particle whose position at 
some instant is reasonably well known, but whose energy is 
accordingly uncertain; the effect of a barrier on a particle whose 
energy is a certainty can be investigated, by means of probability 
currents, as follows: 

If a particle whose energy is certainly jB, where 0 < jB < Fo , 
moves in the potential of Fig. 69, we have, according to 6®^ and (2), 

Si = h{ff/iyp\ — ^[\l/i)l2im 

<3 = ^ = < Sc = (5) 

^ Sr = — ^T'4'r)l2im, 

or, mofe explicitly, 

fSj = !) I A p — » I Aci + Bci 1* 

S =^S. 

(.Sr = » Mci + Be* f - t; i J3 1*, 


( 6 ) 



215 


CURRENTS AND WAVES 


§38 


where v = \/2EJm is the speed that the particle would cer- 
tainly have were it not for __ the barrier, and where Sc = 
P[{Acz + 5c4)(Ac4 + Bcz) — {Aci -f Eh){Acz + Bc4)]/im. We 
note that, in general, So ^ 0, so that, as we already know frona 
the results of MacColI, our barrier transmits probability current 
even when JS < 7o . 

Now, according to the result of Exercise 4®^, Si, Sc , and Sr 
are numerically equal to each other; yet useful information is 
revealed by the details of the structure of (6). Thus we recog- 
nize, for example, that the net current Si flowing iu the left region 
consists of a rightward component v | A P and a leftward com- 
ponent —v\Ac 2 + Bci p, while in the right region the net current 
consists of the rightward component t; | Aci + Bcz T and the 
leftward component — y | £ p. The component currents flowing 
toward the barrier, that is,v\A f in Si and — y | B in , are 
both quite arbitrary and independent of the barrier, so that they 
can be interpreted as incident currents, originating respectively 
at — 00 and at «> . To simplify the problem of interpreting the 
component currents flowing from the barrier, we consider 
states for which current is incident from only one side, say the 
left, and accordingly let B = 0, so that (6) becomes 

fsi = V 1 A — v \ Aci p 

s^=\sl (7) 

Having already noted that probability can flov across our barrier 
even when < 7o , we now interpret 8] in (7) as the transnoitted 
part of the incident current sj ^ f, and the term —v\ Aci f in 
8] as the reflected part. It then follows that the ratios of the 
reflected and transmitted currents to the incident current, that is, 
the reflection, and transmission coefficients of the barrier for the 
current, are 

( 8 ) 

y I -A p 

and *4^1' = I gif, (9) 

v\Af 

where the subscript E emphasizes the fact that the energy of our 
particle is certainly E. Using the value of Ci given in Exercise 



216 


THE SCHROEDINGER METHOD 


§38 


9®, we get the following explicit fonaula for Tg for the case 
0 < jB < 7o : 


= [ 


1 4 - 


1% sinh* Z VicCro - -E) p 


( 10 ) 


4E(Fo - E) 

Graphs of (10) for three particular ralaes of the quantity ml Fo 
are shown in Fig. 61. Note that, in contradistinction to classical 

mechgtiiics, our barrier 
% transmits a part of 

the incideut proba- 
bility current for every 
E lying between 0 and 
Fo , that the constants 
describing the shape of 
the barrier appear in 
Tb in the combination 
, and that, for a 
barrier of a given 
shape, Tb is quite 
sensitive to the value 
of ?w. 

The current (7) is 
independent of x (Ex- 
ercise 4*^), so that 
v \ A? — v \ A(h 



Pig. 61. Graphs of the transniission coeffi- 
cient (10). 


I* = 


i; 1 Aci 1^; thus 1 — 1 (?2 P = I ci p, and hence (8) and (9) satisfy 
the equation Re Tb = 1, that is, Equation (4). The explicit 
formulas for Re and Ts can therefore be checked by means of. (4). 

The probability current is the measure of the probability that 
the particle should pass across a given point per unit time, and 
hence Rs and Tb of a barrier for a current associated with a state 
of energy E can be said to be the reflection and transmission 
coefficients of the barrier. for a particle whose energy is cer- 
tainly E. 

The current Sc flowing in the barrier region does not admit of. a 
simple resolution into rightward and leftward components, and 
indicates, so to speak, that, if the particle is reflected by the 
barrier,, then the refllection may have taken place at any point 
of the barrier. But, as we have seen, no detailed consideration 
of Se is necessary for the computation of R and T, 



217 CTJEUENTS AND WAVES §38 

The rectangular barrier will be considered further in the 
Ekercises. 

A single-step harrier. If the energy E ol a, particle impinging 
on a barrier is greater than theiheight 7o of the barrier, then, 

classically, the particle is certain 
to go over the barrier and to con- 
tinue its forward motion ; but, ac- 
cording to quantum mechanics, 
the particle may be reflected even 
if jB > Vo . To illustrate this 
point, we shall consider a particle 
of energy E moYing in the potential field of Fig. 62 and subject 
to the condition 


T 

f 


I 

jL 


0 X 

Fig. 62. A single-step barrier. 


E>Vo. (11) 

The i/'-f unction belonging to the energy 'valiKi E satisfying (11) 
is, in this, case, apart from the time factor, 




h 


\ Oili- ar / 


+ B r < 0 

aj -j-Qr 




«! + a. 


Oil OLr ittrX i --ictfX 

Q -j- g 

Oil 1- Olr 


a: > 0, 


(12) 


where 


a, = \/kE, ar = \/h(,E - Fo), (13) 

and where the subscripts I and r mean left and right The reader 
should verify that and \pr satisfy the respectiYC Schroedingcr 
equations for the left and for the right regions, and that the various 
coefficients in (12) have been adjusted so that and V'r join 
properly at a; “ 0. 

The probability current for the state (12). is 




Si = ri I 4 1® - »I 


(cKJ — Or") A tiotrJB 


Oi + Or 


Sr = Vr 


2aiA (a, — cii)B 


Or 


(10 


-Vr\B\\ 


where = \/2Ejm and Vr = ■\/2(E — 7o)/m are the respective 
speeds that our particle would certainly have classically in the 



218 


THE SCHROEDINGER METHOD 


§38 


two regions. Inspection of (14) shows that the general state (12) 
involves two incident currents: the current vi\Af coming 
rightward from — oo, and the current —Vr\B\^ coming leftward 
from oo. To get the special Q 0 se in which current is incident 
only from the left, we set R — 0 and reduce (14) to 



Identifying the first term in S\ with the incident current, the 
second term in S] with the reflected current, and the single term 
in S] with the transmitted current, we now get 


Rs = 




1 + 

1 1 







= 

/oci — OrV __ 
\oci -f- arj 

+ I 

1 1 

“ (16) 



to 

1 1 

g’ 





-'1 

/ Y_ 

Wi - 

V,/E 

(17) 


Vi 

\ai + aej 

(1 + VT^ 

- 7o/i)* 


Ts is plotted against Vq/E in Eig. 63; the transmission is thus 
imperfect unless E is much greater than Vo . 

The expression (16) is the reflection coefficient of the single-step 

barrier of Fig. 62 for 
a probability current 
coming from the left. 
Now, what is the re- 
flection coefficient of 
this barrier for a cur- 
rent coming from the 
right? We might think 
oflHiand that in the lat- 
Fig. 63, The transmission coefficient (17). ter case the transmis- 
sion should be perfect, 
since the force at a; = 0 would help the particle to move from 
right to left, rather than hinder it ; a barrier, however, is always 




219 


CURRENTS AND WAVES 


§38 


equally reflecting on both sides, so to speak (Exercise 8). To 
verify this for the present case, we set A — 0, reduce (14) to 



and (identifying the second term in with the incident current 
coming from <» , the first term in Sl^ with the reflected current, 
and the single term in 8? with the transmitted current) get 

“ KssT - (i^)‘ (>« 

as before. 

The single-step barrier will be considered further in the 
Exercises. 

One-dimensional barriers in general. The barriers of Figs. 59 
and 62, made up of 'step-potentials,’ admit of precise mathe- 
matical treatment in terms of the elementary functions; but their 
shapes are quite artificial from the physical standpoint. Barriers 
of greater physical interest are more difficult to study theoretically 
because of the mathematical complexities of solving the appro- 
priate Schroedinger equation, and approximate methods must 
usually be employed. On the assumption that the requisite 
Schroedinger ^’s are available, we shall now outline the general 
procedure for the computation of Rs and Te ; it is essentially the 
same as that used above, and consequently we need discuss it but 
briefly. 

We consider a barrier of the type shown in Fig.- 64, for which 
V(x) approaches a constant Vi when a; — » oo and a constant 

Vr when x — > oo; and we 
restrict ourselves to the 
case when E > Vi and E 
> Vr. For definiteness, 
we assume that the peak 
of the barrier lies near x = 
0, and then specify the (infi- 
nite) region in which V is 
essentially equal to Vi by writing a: <5C — 1, and the (infinite) 
region in which V is essentially equal to Vr by writing a; 2> 1. 




220 ’ TEE SCHROEDINGER METHOD §38 


For X « — 1, the general solution of the Schroedinger equation 
then has the form 

Ae’*'" + a; = \/k(E - 7,); (20) 


for a; ^ 1 the general solution has the form 

4- ■ a, = (21) 

for values of x for which V(x) is markedly dependent on a;, that is, 
in the barrier region, the explicit form of \p depends of course 
on the details of the shape of the barrier. The general solution 
of the Schroedinger equation for the entire region (— «>, oo) 
involves two arbitrary constants; we may take these to be C and 
D in (21), in which case A and B in (20) are no longer arbitrary. 
We now set Z) = 0 and get a ^ of the form 

[ A z«-l 

4 '^ ( 22 ) 

x»l, 

where A and B are primed to indicate that they are not arbi- 
trary. The current for the state (22) has the form 


(vi\A'\^ - vi\ B' f, 

U^icr, 


a: « -1 

(23) 

a:» 1, 


which is. essentially the form (15). The incident current, coming 
from — oo j is given by the term vi \ A' f, and the reflected and the 
transimtted currents are given by the terms —vi\B' |“ and 
Vr\C \ , respectively; hence 


IBM' 



(24) 


The principal mathematical problem in evaluating Bg and Tg 
IS thus the computation of a if-function of the form (22), that is, 
of a solution of the Schroedinger equation which for large values 
o a: e aves like e . A systematic method for computing 
approxnnate \l/’s of this form was devised by Born.* 


references, and examples 

L A^v™,n^ S'*”’ p®"' ^ ^rank Ind 

U A. Young, Phys. Rev,, 38, 80 (1931). 



221 


CURRENTS AND WAVES 


§38 


The application of the W. B. K. ^^--functions of §34 to the 
barrier of Fig. 64, under the condition that E is less than the 
height of the barrier, yields the approximate formula 


n = /(F) exp 



a/2ot(7 



(25) 


here Xi and are the points so marked in Fig. 64, and /(F) is a 
function of F depending on, the shape of the barrier but in general 
varying with F slower than the exponential in (25); in rough, 
work, /(F) may be set equal to unity. 

Some potential barriers of interest in the theory of electron 
emission from metals are followed by a constant force field (a 
linearly varying potential), rather than by a field-free space (con- 
stant potential) as in Fig. 64. In this case, special methods must 
be used to recognize the terms representing, respectively, the 
rightward and the leftward component currents in the expression 
for the net current flowing in the constant field far from the 
barrier.^ 


Exercises 


1. A 1-volt electron impinges head on on a rectangular barrier 1 X 10~* 
cm wide and 3 volts high; use Fig. 61 to show that the probability of trans- 
mission is approximately .5. 

2. Take Ci and cj from Exercise 9“ and verify in detail that the sum 
of (8) and (9) is unity. 

3. Show that, if R > Fo , then the transmission coefficient of the rec- 
tangular barrier of Fig. 59 is 





VI sinn\/K(F- Fo) "1-* 
' 4M(E - Fo) ■ 


(26) 


The fluctuations of Ts with E are described by saying that iox E > Fo 
the barrier is a selective transmitter. 

4, Show that, if JS' < Fo , then Rs of the single-step barrier of Fig. 62 
is unity, that is, reflection is certain, although the particle may tempo- 
rarily penetrate into the barrier region. 

6, Set B = A(ar — (xi)/2ocr in (14), and point out a curious effect of 
coherent superposition of states that then takes place. 

6. Show that Re and Tb in (24) satisfy (4). 

7. Show that, if and are solutions of a Schroedinger equation 

dyjP- dV'* 

belonging to the same value of E, then the quantities ^ — — 


dx 


dx 




and — - •— 

dx 


dx 


are both independent of a;. 



222 


THE SCHROEDHSTGER METHOD 


§38 

8. Consider the state 

ar<$C-l 

=s J (27) 

-1- a;»l, 

rather than the state (22), and show that when a current is incident on 
the barrier of Fig. 64 from the right we have — \ C' \^/\ D' \^and Ts ^ 
(XI 1 B IVar \D' I*; then use the results of Exercise 7 to show that these 
coefficients are identical with those in (24). 

9. Show that, for large values of the quantity 1-\/k(Vq — E), Equation 

(10) reduces to the form (25). _ 

10. Consider the state (22), and compute the average value of for 
a; <3C —1, and the value of for x'^ 1; then, using (24), show that the 
results can be interpreted as follow’s: the particle may be in the region 
a: <3C —1 because it is on its way toward the barrier or because it has been 
reflected; and it may be in the region because it has been transmitted. 

11. Use the transformation formula 58" to compute the distribution- 
in-p for the state (12) with 5 = 0. What possible values of p turn out 
to be the only ones of interest, and what are their probabilities? 

12. Si in (15) can be written as 


5, = 4 1* + fci - w u I* + fc J. (28) 

where k is an arbitrary constant. Use the results of Exercise 11 to show 
that, if A 0 it would be improper to interpret the first term in (28) as the 
incident and the second term as the reflected current for the state (12) 
with 5=0. 

13. Show iu a qualitative way that the transmitted packet in Fig. 60 
should move somewhat faster than the incident MacColl packet, while 
the reflected packet should move somewhat slower than the incident packet. 


39. Virtual Binding 


Let us consider next a potential 



H 

Fig. 66. A two-barrier potential. 


valley between the barriers; if Va 
be either in the infinite 


consisting of two symmetrical 
potential barriers of height 
Vq and approaching the 
value Va for large values of 
I a? I, as in Fig. 65. The 
classical motion may in this 
case fall under one of three 
categories: if 0 < jE? < Fa , 
the particle can move only 
in a part of the potential 


< E < Vq , the particle can 
region at the left of the left barrier, in a 



223 


CURRENTS AND WAVES 


§39 


part of the valley, or in the infinite region at the right of the right 
barrier — but it cannot move from one of these three regions into 
another; if S > Fo the complete infinite region is accessible to 
the particle. In particular, if a classical particle whose energy 
is less than Vo should once be in the valley, it will remain bound 
in the valley forever. 

Even a rough consideration of the Schroediuger i^’s belongmg 
to the potential of Fig. 65 is sufficient to show that in quantum 
mechanics the classification of the possible motions is quite 
different. The energy spectrum of the system consists of a dis- 
crete part lying between 0 and Va , and a continuous part lying 
above Va . The i^-functions belonging to the discrete spectrum 
vanish when | a: | « and differ markedly from zero only between 

the barriers, so that, if the particle is in an energy state such that 
E < Va, then it is forever most likely to be found within the 
valley, and we may say that it is bound there. On the other hand, 
the 1 ^-functions belonging to the continuous spectrum are oscil- 
latory outside the barriers, so that a particle whose energy is 
greater than F« may be found practically anywhere in the com- 
plete infinite region. Thus the quantum-mechanical condition 
for the confinement of the motion to a portion of the valley is 
E < Va , rather than the less strict classical condition E < Vo , 
and the confinement is not quite perfect even if < Fa . 

Now, for certain E’a lying be- 
tween Fa and Fo , there arises in 
quantum mechanics a phenomenon 
which is akin to the binding of the 
particle in the valley and which we 
call virtual binding. To illustrate: 
we consider the idealized two-bar- 
rier potential of Fig. 66, let E lie 
between 0 and Fo , and note that 
the Schroedinger equation then has a particular solution which is 
even in x and which for positive values of z has the form 







ffl 


E 

-L. 



Vo 

uu 



~ 6 - a a b 

Fig. 66. An idealized two- 
barrier potential. 


f cos az, 


if' = < + Be-^% 


C cosa(a: -f 2>), 


0 < a; < a 
a < z < b 
b < X, 


( 1 ) 


where a — -v/JtB and — -y/ k(Vo — E). The conditions on the 



224 

THE SCHKOEDINGER METHOD 

§39 

constants 4, B, C, and D for the matching of the -A’s 
Sit X = a and at a; = b are 

and the 


/ a . \ -|So 

A = i (cos Ota — - sin aaje 

(2) 


( , OL . \ 

B = i (cos oia + sin aa J e 

(3) 


C cos a(b + D) = + J5e ^ 

(4) 


C sinQ:(6 + D) = — 

^ ' OL OL 

(5) 


Without working out the details of the general case^ we now 
take the width Z (= 6 — <x) of each barrier sufficiently large to 
make negligible compared to so that e is negligible 
compared to Disregarding for the moment the exceptional 
case when the factor in parentheses in (2) is zero or very small, 
we then reduce (4) and (5) to 

C cos Q!(& -f D) = (®) 


C sin a(jb -f D) = — 

OL 


(7) 


Squaring and adding (6) and (7), we then find, with the help 
of (2), that 

^ ^cos cxa — -sin ao^ (8) 

Thus, if I is sufficiently large, then is in general large compared 
to unity, so that the amplitude of (1) in the outer regions is large 

compared to that in the valley, 
and the particle is less likely to 
be in the valley than in any other 
region of length 2a. A Schroed- 
inger function having a relatively 
large amplitude outside the bar- 
riers is shown in Fig. 67, where 
the barriers are indicated by the 
thick parts of the a;-axis. 

The exceptional cases arise when 
the factor in parentheses in (2) is zero or very small, that is, when 
E satisfies or nearly satisfies the equation 

tan aa = jS/a. 



Fig. 67. A Schroedinger func- 
tion for the system of Fig. 66, 


(9) 


225 

CURRENTS AND WAVES 

§39 

Equations (2) to (5) then reduce to 



A = Oj B = e^^'cosaa 

(10) 


C cos a(b + D) = cos oca 

(11) 


C sin a{b + D) = sin aa, 

(12) 

and we get 

II 

o 

(13) 


Hence, in the exceptional casejs, is smaller than unity, (1) has 
the form indicated in Fig. 68, and the particle is mqre likely to be 

in the valley than in any other re- 
gion of length 2a. The values of E 
for which the Schroedinger equation 
for an unbound system possesses 
solutions having relatively large 
amplitudes in a limited region are 
called virtual energy levels of the sys- 
tem. It will be shown in Exercise 
2 that the virtual levels of our pres- 
ent system are grouped about the 
discrete true levels of a rectangular 
well of width 2a and depth Fo . 

We shall next estimate roughly 
the mean time, tm , which our par- 
ticle may be expected to spend within the valley if its state 
is specified by the i/^-function (1). Using for the relative 
distribution-in-a;, we obtain for the relative probability of finding 

the particle within the valley the value / cos^ ax dx, that is, 

approximately a. The corresponding relative probability of find- 
ing it in an interval of length 2a lying outside the barrier is 

/ a:+2a 

cos^ a(x + D) dXj that is, approximately aCl Further, 

when the particle is far outside the barriers, it may be imagined 
to behave as though it were a free particle of energy E and speed 
-s/ 2£//m, and hence the time that it may be expected t o spen d in an 
interval of length 2a lying outside the barriers is 2a V 7n/2£?. The 
proportion 

^ 

2a y/rnJ2E aC^ 



Fig. 68. A Schroedinger func- 
tion for the system of Fig. 66, 
illustrating virtual binding. 


(14) 



226 


THE SCHROEDINGER METHOD 


§39 


now yields the approximate result 



According to (8), tm is, in general, of the order of magnitude of 



while for a virtual energy level we get, using (13), the much 
greater mean time 

= (17) 

In the case of the barrier of Fig. 65, the situation is similar. 
Thus, if the W. B. K. ^-functions of §34 are used in an approxi- 
mate treatment of the problem, we find that, when Fo < .E < T^o , 
the square of the amplitude of a ^-function outside the barrier is, 
in general, larger than that in the valley by a factor of the order 
of magnitude of 

exp [If -s/2m(F - E) dxj, (18) 

where Xi and are the points so marked in Fig. 65; this result 
corresponds to (8). But, for special values of E, there exist 
^'-functions for which the ratio of the square of the amplitude out- 
side to that inside is approximately 

exp [-If V2m(F - E) da:]; (19) 

this result corresponds to (13). These special values of J?, that 
is, the virtual energy levels, turn out to lie near the discrete levels 
of the potential well that we get if, in Fig. 65, we replace the two 
parts of the curve, lying outside the maxima, by horizontal lines 
through these maxima. For the case of a virtual level, the mean 
stay inside the valley comes out to be approximately 

^ 't/s [I / V2w(F - E) dx^, (20) 

where L is the width of the valley at the height E. 

To summarize: When F. < ^ < Fo , the i^functions for two- 
barrier potentials of the form shown in Fig. 65 have, as a rule. 



227 


CURRENTS AND WAVES 


§39 


large amplitudes outside the barriers, and the mean stay of the 
partible in the valley is usually very short. But, for special 
values of Ej there exist \p^s having small amplitudes outside the 
barriers, compared to the amplitudes inside; for the corresponding 
states, the mean stay of the particle in the valley is long, and the 
possibility that the particle is in the valley and will remain vir- 
tually bound there for an appreciable time may be sufficiently 
probable to be of physical interest. 

Radioactive emission of a-particles. When extended to three 
dimensions, the considerations outlined above permit a theo- 
retical interpretation of the emission of ce-particles by radioactive 
atomic nuclei. To summarize the essentials of the problem, we 
quote, with a few minor changes, a passage from Rasetti:® 

We shall first discuss the serious difficulties that are encountered when 
we try to explain the phenomenon of ^-disintegration from the standpoint 
of the classical theory. These difficulties appear in striking form if we 
compare the spontaneous emission of a-particles by a nucleus with the 
scattering of a-particles by the same nucleus. Let us consider, for example, 
the uranium nucleus. 

Rutherford’s scattering experiments show that even the fastest a-par- 
ticles available (those of Th C', whose energy is 14 -lO"® ergs) are unable, 
even in a head-on collision, to penetrate close enough to the nucleus to 
show departures from the Coulomb law. This observation means that, 
at least up to a distance of 3-10“^® cm. from, the center of the nucleus, 
where the potential energy F(r) of the a-particle is 14*10"® ergs, this 
potential energy is still expressed by the Coulomb formula: 

V(t) = 2ZeVr, 

W’here Ze is the charge of the nucleus, and 2e and r are the charge of the 
o£-particle and its distance from the center of the nucleus. At smaller 
distances, where the a-particle cannot penetrate, we shall certainly find 
deviations from the Coulomb potential, since the nearly stable binding 
of a-particles in the nucleus requires the existence of a potential hole in 
the center of the nucleus. 

The general shape of the function V (r) must therefore be the one indi- 
cated in Fig. 69, where the dotted line represents the Coulomb potential, 
and the solid line the actual potential. The inner part of the curve has 
been traced arbitrarily; but for r > 3*10“^* cm., the scattering experiments 
show that there is no appreciable departure from the Coulomb potential. 

The uranium nucleus spontaneously emits particles whose energy is 
6.6*10”® ergs. (This energy is indicated in the diagram in Fig. 69.) It is 
consequently difficult to understand how the particles contained in the 
inside of the nucleus can go over a potential barrier which is at least twice 


* F. Rasetti, Elements of Nuclear Physics^ page 100. New York: Pren- 
tice-Hall, Inc., 1936. 



228 


THE SCHEOEDINGEE METHOD 


§39 


as high as their total energy. According to the classical theory, particles 
of this energy could originate only from a point at a distance of 

cm. from the center of the 
nucleus, where the Coulomb 
potential energy has the value 
of 6.6 •lO”® ergs. However, in 
this region there is no pos- 
sibility for the stable binding 
of an a-particle. In other 
words, we can say that, in the 
classical model, an a-particle 
emitted by a nucleus should 
have a kinetic energy corre- 
sponding at least to the top of 
the potential barrier. Eor ex- 
ample, in uranium the en- 
ergy should be higher than 
14*10“® ergs. 

This difficulty disappears 
when we treat the problem from the standpoint of quantum mechanics, 
as has been done independently by Gurney and Condon* and by Gamow.® 

Indeed, according to Fig. 69, the potential on any line passing 
through the center of the nucleus has essentially the two-barrier 
form of Fig. 65, so that quantum-mechanically we may suspect, 
in view of the experimental results, that the situation is as follows: 
there is a virtual level of 6.6* 10"^ ergs; the not-yet-emitted 
Qi-particle occupies this level, being virtually bound inside the 
nucleus; the ^-function describing this level does not, however, 
vanish outside the nucleus, and consequently the particle may 
eventually get out; and when this happens a spontaneous emission 
of the a-particle is said to have taken place. 

When put to the test, this hypothesis fits in well with the 
experimental data concerning a-activity, for it leads to the expo- 
nential law of decay and the Geiger-Nuttall relation between mean 
life and range, yields the virtual levels at approximately the right 
places, and, through the exponential factor in (20), accounts for 
the fact that the mean lives of certain distinct nuclei differ from 
each other tremendously in spite of the evidence for the essential 
similarity of the respective potential barriers. An a-particle, 
however, is acted on by several moving nuclear constituents, the 

* B. W. Gurney and E, U. Condon, Nature^ 122, 439 (1928); Phya. Rev,^ 
33, 127 (1929). 

« G. Gamow, Zeils. f. Physik, 61, 204 (1928> 




229 


CURRENTS AND WAVES 


§39 


valley portion of Fig. 69 can at best be but a time average, and 
hence even elementary considerations indicate a limit to the pre- 
cision that we may expect of this simple theory. 

The general form of the law of decay can be indicated by the 
following rough argument based on an interplay of classical and 
quantum-mechanical ideas. We return to Fig. 65, denote the 
energy of one of the virtual levels by E, denote by R and T the 
reflection and transmission coeflSicients of either one of the two 
barriers for a particle of energy E, and assume that T is so small 
that the higher-order terms in the expansion log (1 — T) = 
— T — jr* — • ■ ■ can be disregarded and that hence 

logfi! = log (1 - T) = -r. (21) 

Next we imagine that a particle (of energy E) which is in the 
valley at the instant i = 0, oscillates classically back and forth 
between the barriers, but at each collision with a barrier it has a 
chance T of being transmitted out. The probability P{t) that 
it is still within the valley at the instant t is then R after one 
collision, after two collisions, and so on, that is, 

P{t) = Je", (22) 

where n is the expected number of collisions between the instants 
0 and t) in view of (21) we then have 

P(f) = j?” = e”'*** = e"’'". (23) 

To estimate n, we turn again to the classical theor y, take the 
average speed of the particle to bo approximately •\/2EIm, and 
find that 

n = vt/L = \/ 2E/m t/L, (24) 

where L is the width of the valley for a particle of energy E. 
Eliminating n from (23) by means of (24), and T by means of 
25®®, with f{E) = 1, we finally get 

Pit) = (25) 

where 

^ 4 /S [1 if V'2W^ da:], (26) 

Zi and X 2 being the points so marked in Fig. 65. Thus Pit) 



230 


THE SCEBOEDINGEE METHOD 


§39 


decays exponentially with time, in agreement with experiment. 
Note that r turns out to be identical with the t'm, given, by (20). 


Exercises 


1. Show that the Schroediager equation for the potential of Fig. 66 
has a solution that is odd in x and that for positive values of x has the form 


’sin ax^ 0 < x <a 

A -f- B a < x <h 

C' sin ot(x 4“ JDO, b < x 


(27) 


and that the system possesses virtual levels lying at and near the E‘s 
satisfying the equation 

tan aa - — a/jS. (28) 

2. Show that (9) and (28) can be combined into the single formula 19**. 

3. Consider the curvature of the and show graphically why virtual 
levels may be expected to exist when V(x) is as in Fig. 65. 


40. Waves 

The assumptions with which we began our discussion of quan- 
tum mechanics were formulated sometime after Schroedinger 
proposed his theory, and our approach to, say, the Schroediager 
equation is quite different from that used originally by Schroe- 
dinger h i ms elf. The stimulus for Schroedinger's researches that 
resulted in his celebrated equation came from the work of de 
Broglie,® who proposed the remarkable idea that a kind of vibra- 
tion is Associated with a moving particle, just as — ^so interference 
experiments suggest — a light wave is associated with a moving 
photon. Schroedinger's theory in its original form (his wave 
mechanics) made constructive uses of the notion of a wave motion 
associated with a particle motion, even though no entirely satis- 
factory physical interpretation of the ^associated waves* was 
avmlable at the time.’ In fact, the extraordinary rapidity with 
which Schioedinger's researches progressed was at least in part 
due to the similarity that some of the mathematical problems 
involved bore to certain problems in classical wave motion. 

We should then expect that, whatever the manner in which we 
may introduce the Schroedinger method of studying the motion 


r Physique, Z, 22 (1925) ; Thhses, Paris, 1924. 

M ^ tlf amplitudes was proposed by 

M. Bom; Zeits. J. Physik, 37, 863 (1926). 



231 


CURRENTS AND WAVES 


§40 


of a particle, the mathematical apparatus employed in the applica- 
tions of the method should resemble that used in the classical 
theory of vibrations of continuous media, and that some of the 
terms used in the classical theory of wave motion should find a 
place in the quantum theory of particle motion. The remainder 
of this chapter is intended to indicate the extent to which this is 
so; we begin by reviewing certain terms. 

Terminology. If the quantity u = u{x^t) depends on the 
coordinate x and the time for all values of x and ty through the 
relation 

u = A cos 2Tr{hx — vt + (1) 


Ml 

amplitude of u 

A2 

intensity of u 

h 

propagation constant of u 

1*1 

uave number of u 

1 l/ft 1 = X 

wave length of u 

V 

frequency of u 

v-\ = T 

period of u 

27r(kx — -f 5) 

phase of u 

27r5 

initial phase of w 


where A, fc, and d are real constants and v is a positive constant? 

we call u a real co^inusoi- 
Table 10 dnusoidal) wave of 

infinite extent in x and in 
ty or simply a real mono-' 
chromatic wave; the term 
monochromatic implies 
both the sinusoidal shape 
of the wave and its infi- 
nite extent in x and L 
The usual names and 
symbols for the various 
quantities involved in (1) are listed in Table 10. 

Waves of type (1) arise, for example, in connection with small 
plane transverse vibrations of a uniform elastic string of infinite 
length whose equilibrium position coincides with the a:-axis; and u 
may then represent, for every value of x and tj the transverse 
displacement of the string from its equilibrium position. In our 
discussion we shall not, however, restrict ourselves to any par- 
ticular physical interpretation of u. 

For a given type of a physical wave, the frequency v and the 
wave number | ft | are usually dependent on each other in a 
definite way; for example, if (1) represents the transverse dis- 
placement of a uniform elastic string, we have 


where p is the linear density and r is the tension of the string. 


8 This restriction, imposed here for simplicity, is sometimes inadvisable. 



232 TEE SCHROEDINGER METHOD §40 


Once A in (1) is fixed, the numerical value of u at a given point x 
at a given instant t is deternained by the phase 

(j> = 2^(]zx — vi + 6 ) ; (3) 

hence, as t increases, any preassigned value of u drifts, so to speak, 
along the :c-axis with the velocity v/k, which, in view of the 
dependence of y on A, can be regarded as a function, of k alone. 
We denote this velocity by w^(k) or by v)^ , so that 

= w^ik) = v/Jb, (4) 


and call it the phase velocity of % (rather than simply the velocity 
of u) to emphasize that it is the velocity with which the values of 
the phase of u are, in the course of time/ transferred along the 
a;-axis. The direction of the phase velocity is determined by the 
•algebraic sign of the propagation constant ft. 

If 

U cos 2iT(ksX — V^t -f“ da)} s = 1, 2, 3, • • • , 

we call u a composite wave,^ and say that it is a discrete superposi- 
tion of the monochromatic component waves Ai cos 27 r(kxX — 
vit + ^i), A% cos 2T{k2X — Pit Si)} and so forth. 

A more general t3?pe of a composite wave is a contimious super- 
position of monochromatic waves, that is, 


u 



cos 2T[hx — v(]c)t -|- 5(ft)] dft. 


( 6 ) 


The integral in (6) stands in more or less the same relation to the 
sum in (5) as does a Fourier integral to a Fourier scries. 

A monochromatic wave preserves its sinusoidal shape as time 
goes on, and consequently the concept of the velocity of such a 
wave is^ a simple one. This is in general not so in the case of a 
composite wave, because the respective speeds of the component 
waves may be different (when we call the propagation dispersive^} 
or, even if these speeds are the same, the directions of the naotions 
of the component waves, may be different, so that the composite 
wave as a whole may not preserve its shape; in fact, it is not 
usually mstructive to interpret any single number associated with 


that vre regard a wave such as cos 27r(;fca; - vi) 4- cos 2 r(-fta; - vO 
as composite, even though it involves only a single frequency. 



233 


CURRENTS AND WAVES 


§40 


a composite wave as its velocity, especially when the propagation 
is dispersive. An important exception arises, however, when the 
composite wave is a superposition of monochromatic waves of 
nearly equal amplitudes and propagation constants. To illus- 
trate, we let u be the superposition of the two monochromatic 
waves 

Ui ^ A cos 2Tr{kx — vt) (7) 

and == A cos 27r[(fc -f Ak)x — (?/ -f- Av)t] (8) 

whose amplitudes are the same, whose propagation constants 
differ by an amount A& small compared to fc, whose frequencies 

differ by an amount Av small compared to and whose initial 

phases are both zero. Then 

w = Ui -f- = 2A cos 7r[(2fc + Afc)a? — (2p + Av)t]^ 

cos Tr(Ak-x — Av^t), (9) 

or, approximately, 

u = 2A cos 2ir(kx — vt)*G08 ir{Ak-x — Av-t), (10) 

Our composite wave is thus approximately a monochromatic 
wave modulated by the factor cos Tr(Ak-x — Ap4). The com- 
ponent waves (7) and (8) and their superposition (10) are shown 
for a fixed instant of time in Fig. . 70. The composite wave 

VWWWVAAAAAAAAAAA/' 

Wl^A/W\AA/W\AAA/W\/WV 

V\aa/"-a/\aAAa/^^ 

Fig. 70. Superposition of two sinusoidal waves. 

consists at any instant of groups of ripples, the arrangement of 
these groups (or space beats) being controlled by the modulating 
factor. Now, as time goes on, the shape of u changes in detail; 
but the arrangment into groups persists for all time, and the 
velocity with which the groups move is determined by the velocity 
with which the phase of the modulating factor drifts along the 
ic-axis. Inspection of (10) shows that this velocity, called the 
group velocity of our composite wave and denoted by u)g(Jo) or 



THE SCHROEDINGER METHOD 


234 


§40 


by Wf , is Av/ Ak, so that, if Ak and are allowed to approach 
zero, we have 

w, = w.(fc) = (11) 


Although (11) was derived above for a special case of no partic- 
ular interest, it represents in. fact the velocity of wave groups 
whenever these groups are formed by a superposition of mono- 
chromatic waves whose amplitudes are nearly equal and whose 
propagation constants all lie near a single value k. To give 
another example, we consider a composite wave u whose com- 
ponent waves interfere with each other in such a way as nearly to 
cancel everywhere except in a limited region, so that u consists 
essentially of a single wave group, that is, as we often say, a single 
wave jacket. A wave of this type is 


^ "" A-Afc ~ vik)t]dk, (12) 

provided that within the region of integration v{k) can be expressed 
sufficiently accurately by the first two terms of its Taylor series 
in k, so that 

Kfc) = Kh) + (fc - fco) (;i3) 

and provided that within this region A(k) remains essentially 
constant. To verify that (12) then represents a single wave 
packet, we write it as 


/ 'fco+A* p / \ -\ 

.-At 

and, integrating and omitting the zero subscripts in the result, get 


u — 2A(fc)Afc-cos 2Tr[kx — v{k)^ 


sin 27rAfc 


h(l)‘] 


27rAfc 


/-(SC 


(15) 


t “°4ulated monochromatic wave whose modulating 

do^,rep,«»nt a amgle 
p ne snape of (15) for a smgle instant is indicated 



235 


CURRENTS AND WAVES 


§40 


in Fig. 71 . The group velocity of our wave is the velocity with 
which the maximum of the modulating factor drifts along the 
a:-axis, and inspection of (15) shows that this velocity is given 
by (11). 


” u 

Fig. 71. An instantaneous graph of the packet (16). 


Sometimes, for short, we speak of Wg(k) as ‘the group velocity 
of a monochromatic wave having the propagation constant k,’ 
this term being an abbreviation for ‘the velocity of wave groups 
formed by a superposition of monochromatic waves whose propa- 
gation constants all lie near the value k.’ 

In some calculations involving real waves, it is convenient to 
use complex exponentials rather than trigonometric functions 
and, using the symbolism of 2®, to write (1) as 

M = Re (16) 

Complex exponentials can be used in a similar way in dealing 
with real composite waves. 


Exercises 

1. Use (2) to show that the propagation of the transverse displacements 
of a uniform elastic string is nondispersive. 

2. Show that, in order for a wave propagation to be nondispersive, it is 
necessary and sufficient that the group speed equal the phase speed. 

3. Show in a qualitative way that the wave packet (12) spreads out as it 
moves unless the right side of (13) expresses v(k) exactly. 


41. De Broglie Waves 

The Schroedinger function specifying the state of a free particle 
moving in one dimension with a definite momentum is 

^ ( 1 ) 

where A is an arbitrary nonvanishing constant (in general com- 
plex) whose value does not affect the physical properties of the 
state in question, and where p and E are the respective definite 



236 THE SCHROEDINGEE METHOD §41 

numerical values of the momentum and the energy of the particle. 
Now, if we write (1) as 

^ = ( 2 ) 

we note that \l/ resembles the expression 

(16*®) M = Re ( 3 ) 

for a real monochromatic wave to such an extent as to suggest that 
we introduce the notion of a complex^^ monochromatic wave, 
intepret our as such a wave, and ascribe to ^ an amplitude A, 
an intensity | 1^, a propagation constant k, a wave length X, a 

frequency p, and so forth, by correlating the various constants in 
(2) with those in (3). The correlation yields, in particular, 

k = p/h, (4) 

^ = E/h. (6) 

. When the Schroedinger function (1) is regarded as a complex 
wave, it is called a monochromatic de Broglie wave, and the quanti- 
ties I h/p I , I E/h I , and so on, are then called the de Broglie 
wave length of the particle, the de Broglie frequency of the particle, 
and so on. 

Since any Schroedinger function can be expanded at any instant 
in terms of the eigenfunctions of momentum, we may regard any 
Schroedinger function as a superposition of monochromatic de 
Broglie waves and may speak of it as a wave function or simply a 
wave, A Kennard packet, for example, may be referred to as a 
Kennard wave packet. 

Although complex waves cannot be readily visualized, we may 
define their velocities by adopting and 11^°: 

= p/k, Wg = dp/dk, (6) 

In view of (4) and (5) we then get 

% = EJp, Wg = dE/dp (7) 

for the phase and the group velocity of a de Broglie wave asso- 
ciated with a free particle whose momentum is certainly p, Re- 


1® Our terms complex and composite should not be confused. 



237 CURRENTS AND WAVES §41 

mv, we get, 

( 8 ) 

(Oa, b) 

from (4). Thus, in particular, the group velocity of a de Broglie 
wave associated with a free particle of definite momentum equals 
the velocity of the particle/^ 

By introducing such terms as de Broglie wave length, frequency, 
and so on, wo have provided new names for certain quantities 
occurring in quantum-mechanical formulas, but have not in any 
way modified or augmented our fundamental postulates; whether 
we do or do not refer to Schroedinger functions as vraves, we must 
go through exactly the same mathematical procedure when we 
wish to determine the physical implications of the assertion that 
the state of a system is specified by a certain ^-function. The 
prihcipal reason for regarding Schroedinger functions as waves 
and for speaking of the wave length and the frequency of a par- 
ticle is that the mathematical properties of complex waves are in 
many respects similar to those of real waves, and that consequently 
the wave terminology may bo helpful, in a subjective way, to one 
familiar with the properties of the real waves encountered in the 
classical theory of continuous media. 

The classical notion of the entity called a particle is inextricably • 
bound up with the possibility of ascribing to this entity at any 
instant a definite position and, simultaneously, a definite momen- 
tum; in view of the Heisenberg inequalities, then, the quantum- 
mechanical notion of a particle is not precisely the same as the 
corresponding classical notion. The fact that we now succeed 
in associating, in a reasonably straightforward way, such terms as 
wave length and frequency with cMain quantities involved in 
the quantum theory of the entity that we have been calling (and, 
for lack of a more suitable and generally accepted term, shall 
continue to call) a particle^ suggests that under suitable circum- 

To conform to the notation of §40 we should, strictly, write (9b) as 
X = h/m I V I and (6) as v » | ^ \/h. 

1* In the relativistic de Broglie theory, Wg ^ v and so that 

is not equal to iv, but is greater than the speed of light c. 


calling that, for a free particle, E == p^/2m and p = 
more explicitly, 

Wg = V 

from (7), and“ 




mv 



238 


THE SCHROEDINGER METHOD 


§41 

stances this entity may display properties which are usually 
ascribed to waves. This is indeed true; and it is for the very 
reason that quantum mechanics allows for the coexistence of 
particulate and undulatory properties in a single entity that it is 
successful in correlating experiments (for example, Compton^s) 
which bring to the fore the corpuscular nature of the electron with 
experiments (for example, Davisson and Germer^s) which reveal 
its wave properties. 

In conclusion, we must emphasize the fact that the de Broglie 
relation X = }ilmv was inferred above from the state (1), which is a 
momentum state of a free particle and which consequently implies 
that the position of the particle is entirely indeterminate. There- 
fore, to use a rather crude illustration, it is quite wrong to point 
at a billiard ball of mass m moving on a table with velocity v and 
to say, '^With this ball is associated a monochromatic de Broglie 
wave of wave length h/mvJ^ The very fact that we are pointing 
at the ball means that we have a considerable amount of informa- 
tion regarding its position, and consequently the state of the ball 
is not a momentum state and is not described by a ^-function of 
the type (1). The true situation is somewhat as follows: we 
know roughly both the velocity and the position of the bell, so 
that the distribution-in-a; and the distribution-in-p both have the 
form of packets; the ^-function representing the state of the ball 
is consequently a wave packet consisting of a superposition of an 
infinite number of de Broglie waves of different wave lengths, the 
component wave of wave length h/mv being particularly promi- 
nent; each of the component waves is of infinite extent, but these 
components interfere with each other so as practically to cancel 
everywhere except in the immediate vicinity of the most likely 
position of the ball; and, as time goes on, the a;-packet moves with 
the group velocity Wg ^ v oi the composite wave, that is, with the 
velocity of the ball. 

Exercises 

1. Show that the de Broglie wave length of an electron having the 
kinetic energy of P electron volts is approximately 12.2 P“^ angstroms. 

2. Set P = 0 in 2®®, interpret the two terms in the resulting as an 
incident and a reflected wave and the resulting as a transmitted wave, 
compare the intensities of the incident and the reflected waves, and show 
that, when 0 < P < Vo , the reflection coefficient of the rectangular barrier 
of Fig. 59®® for the incident wave is 8®®; then compare the intensities of the 
incident and the transmitted waves, and derive 9®®.. 



239 


CURRENTS AND WAVES 


§41 


3. Use the procedure of Exercise 2 to derive 16®® and 17®®; note that in 
deriving 17®® independently of 16®® it is necessary to take into account the 
fact that the potentials on the two sides of a; = 0 in Fig. 62®® are different. 

4. Sho^v that 26®® vanishes whenever I = JnX, where n — 1,2, • • • , and 
where X is the de Broglie wave length of the particle in the central region 
of Fig. 59®® when E > To . What phenomenon in the transmission of a 
wave incident normally on a refracting plate does this effect resemble? 

6. Evaluate the de Broglie wave length and the de Broglie frequency of a 
free particle when V(a:) = Vo 9 ^ 0. 

6. Draw a rough graph of the Kennard wave packet 212® for the case 
po = 0 and contrast it with the wave packet of Fig. 

42. A Vibrating String and a Particle in a Box 

The mathematical similarity between the methods of the Schroe- 
dinger theory of a particle and of the classical theory of wave 
motion makes it possible to correlate problems in the quantum 
theory of particle motion with certain problems in classical wave 
motion in such a way that acquaintance with a problem in one 
theory is helpful in dealing with a particular problem in the other 
theory. Thus there is a certain similarity between the methods of 
handling the classical problem of the small vibrations of an elastic 
string of infinite length and the Schroedinger method of studying a 
particle moving freely in one dimension; the Schroedinger treat- 
ment of the motion of a particle in a one-dimensional box is 
analogous in several mathematical details to the classical problem 
of the small vibrations of a finite string fixed at both ends; the 
Schroedinger technique for treating the linear harmonic oscillator 
resembles that used in the classical theory of the diffraction of 
waves by a parabolic cylinder — these are but a few of the examples 
that can be quoted. But in pointing out such similarities we must 
emphasize the fact that the dissimilarities which may accompany 
them are often so important that it is in general quite unsafe, 
without a careful scrutiny, to carry the conclusions drawn in the 
one problem over to the other. For example, the analogy of an 
infinite string would suggest that the propagation of the de Broglie 
waves of a free particle should be nondispersive; we know, how- 
ever, that this is not so, because the values of and in, given by 
8“ are not equal to each other. 

To illustrate these matters we shall now discuss briefly a classi- 
cal string and a quantum-mechanical particle in a box. 

The small transverse vibrations of an elastic string fixed at 
both ends. We consider a string fastened at a: = 0 and at a: = Z, 



240 


THE SCHROEDINGER METHOD 


§42 


whose equilibrium position coincides with the aj-axis, whose linear 
density is a constant p, and whose tension is a constant r; and 
we assume that the string is perfectly el^tic and can support only 
tangential stress, that friction can be .neglected, that the angle 
between the a;-axis and the tangent to the string at any point is 
small/^ that the displacement u = u(x, t) of every point of the 
string from its position of equilibrium is at every instant transverse 
to the a:“axis, and that the displaced string lies in a plane. 

When the string is displaced 
(Fig. 72), its length is given by 
the standard calculus formula 



i 


as / [1 (du/dx)^]^ dx; since 


Fig. 72. A string displaced from 
the equilibrium position. 


du/ds^ is in our case small, we 
may take this length to be 

[1 -f dx, so that 


i: 


the string is longer than I by the amount j {du/dxy dx. Tak- 
ing the potential energy of the undisturbed string to be zero, we 
find that the potential energy of the displaced string (elongation 
times tension) at the instant t is 




( 1 ) 


The kinetic energy of a portion dx of the string at the instant t is 
^{pdx)v^ = ^pdx{du/dty, and hence the kinetic energy of the whole 
string at the instant t is 


The total energy of the string is consequently 

To derive the differential equation satisfied by we set up the 
equation 

( 4 ) 


dx. 


( 2 ) 


(3) 


force = mass X acceleration 


For details, see a book on theoretical physics. 



241 


CURRENTS AND WAVES 


§42 


for the short portion AB of 
the string in Fig. 73. The 
external forces Ti and T 2 are 
both of magnitude r, so that 
the resultant upward force 
is r(— sin ai + sin a 2 )] that 
is, if we replace the sines of 
the small angles by their tan- 
gents, it is 

r(— tan ai tan 0 : 2 ); (5) 

the resultant horizontal force is r(— cos ai + cos and, to the 
first approximation, is zero. Equation (4) thus becomes 

^ It 

r(—tan cii -1- tan 012 ) = pAa; — ~ . (6) 

bv 

Now, the slope of u at A is.tan oli = du{x{t)/dXj while the slope of 
w at jB is tan 012 = bu(x -j- Lx, t)/dx, so that, to the first order in 
Lx, tan ot 2 — ■ bxt(^x, i^Jbx -j- Lx^b u^x, 1^/bx and — tan ckj “4" 
tan a 2 = Lx^b\/dx^, Substituting this result into (6), canceling 
Lx, and dividing through by r, we* finally get 

dx^ T a<**' ^ ^ 

In addition to satisfying (7), the displacement of a string fixed at 
X = 0 and at k = Z satisfies the boundary conditions 

u(0, t) = 0, u{l, t) = 0. (8) 

The standard method of computing the general solution of (7) 
satisfying the conditions (8) runs as follows. To begin with, we 
look for solutions of (7) having the rather special form 

uix, t) = X{x)m, (9) 

where ^(a:) = Z is a function only of x, and T{t) = T is a function 
only of Z. To determine the conditions that must be satisfied by 
X and T in order that (9) may be a solution of (7) , we substitute (9) 
into (7) and, writing X" for d^X{x)/d7? and T^' -for d^T{t)/df, get 
X"T = pXT"/r, that is, 

^ p r ' 

X ; y 


T, 



Fig. 73. Force diagram for a por- 
tion of a string. 


( 10 ) 



242 


THE SCHROEDINGER METHOD 


§42 


Siace X is a function only of x, the quantity 'X" JX depends only 
on X and is independent of f; similarly, the quantity pT'' frT 
depends only on t and is independent of x. Now, Equation (10) 
asserts that the quantity X" fX (which, as we have seen, can 
depend only on x) is equal to a quantity that is independent of x ; 
hence X"/X does not depend even on x, and we may write 
X"/X = -K, that is, 

X" + KX = 0, ( 11 ) 

where K is some constant regarding which we have as yet no 
information. It now follows from (10) and (11) that T"/T — 
—rK/p, that is. 


.T" + -^KT = 0. (12) 

Ihus the problem of finding the solutions of (7) having the 
special form (9) reduces to that of finding the respective solutions 
o t e two equations (11) and (12), each of which involves only one 
independent variable. The process of converting a differential 
equation containing several independent variables into several 
differential equations each of which contains fewer independent 
variMs^ original equation is called sefparation of 

The respective general solutions of Equations (11) and (12) 


X = JL sin K?x + B cos S^x 
\i 

^ + Z) COS 




CIS) 

(14) 


“iri T = C sin 

SO that (9) becomes 

« - W sin £ eos K*xJ. [(7 ^ Doos (15) 

‘““-told M15) vnniah 
and for a - I, and consequently we must have 

5=0, K* = riTjl, n = 1, 2, 3, . ■ . , (16) 

the case « = 0 being trivial. Introducing the quantity 

__ n 

''”~2iy-p (17) 



CUREENTS A.ND WAVES 


243 


§42 


and writing 'Wn for ^ we now reduce (15) with the help of (16) 


%ri = sin cos 2,ir {vnt •+ 5n), (18) 

where An and 5n are arbitrary constants. T'he reader will perhaps 
recognize (17) as giving the characteristic frequencies of our string, 
and (18) as describing its characteristic vihiations. Since tlie 
values of u outside the limits 0 and I are of no physical iiitcrost, 
we may take them to be zero and may W'rito tho expression for th(^ 
?ith characteristic vibration as 




'o a; <0 

< Re sin 0 < a; < Z (19) 

0 Z < r. 


Now, the problem that we undertook to solve is to find the 
general solution of (7) subject to tho eonditions (8), and at first 
sight the identification of the solutions of the rather special form 
(9) may appear to be a digression from our main object. In fact, 
however, any solution of (7) which is of physical intcr(?at and 
satisfies (8) can be expressed as a sup<!rpo.sition of the char- 
acteristic 'vibrations, “ that is, as 


w = Z)n ^>1 sin ^ cos 2v{vnt + S„), 


1, 2, 3, 


( 20 ) 


The function (20) — which contains the infinitely many arbitrary 
constants An and 5„ and which no longer has the restricted form 
(9) — ^thus comprises the complete solution of (7) subject to tho 
conditions (8) ; the largo number of arbitrary oonstants is duo to tho 
fact that (7) is a partial rather than an ordinary diff(!retitial 
equation. 

Using (3), we find that the energy of a string ('xoouting thof nth 
characteristic vibration (18) is 


W. = 



(21) 


and that the energy for the general vibration (20) is 


^ = n = 1, 2, 3, . ■ . . (22) 



244 


THE SCHEOEDINGEE METHOD 


§42 


Thus the energy of a characteristic vibration is, in particular, 
proportional to the square of- the amplitude of the -vibration, 
and the energy of a more complicated motion is the sum of the 
energies of the characteristic component vibrations into -which the 
motion as a whole can be resolved. 

A -vibiatiag string and a particle in a box. Let us now turn to a 
quantum-mechanieal particle in the potential box of Eig. 36’“. 
The second Schroedinger equation 1*’ then has, in the region 
0 < r < the form 


3^2 



(23) 


the eigenvalues of H are 

(5**) K = (24) 


the (not normalized) eigenfunction of H belonging to En can be 
written as 


( 4 * 0 ) 


Pn = { 


An sin 
0 


a: <0 

0<x<l (25) 

l<^, 


where and dn are arbitrary real constants, and 
v« = En/h = n^hJSmf; 


(26) 


and the ^-function specifying an arbitrary state of the system oan 
be expressed as a superposition of the ^-functions (25) . 

We note first of all the close resemblance of the given by 
(25) to the Un giyen by (19) — ^the resemblance responsible for the* 
drawing of analogies between the particle in the box and the 
vibrating string. But, we should al^o note a number of dis- 
similarities, some of which are listed below. 

The Schroedinger equation (23) involves the first time deriva- 
tive of while the string equation (7) involves the second time 
derivative of u. This fact is responsible for many diflFerences 
between the respective properties of ^ and of u, among which the 
following is perhaps the most fundamental: the knowledge of ^ 
as a function of x for any single instant k is sufficient to determine 
\p for all time ; on the other hand, the knowledge of as a function 
of X for a single instant to is not sufficient to determine u for all 
time — ^the behavior of the string depends not only on its dis- 



245 CURRENTS AND WAVES §42 

placement at some initial instant , but also on the velocities of 
all of its points at that instant.^* 

The function Un is always and everywhere real, and may be 
pictured at any instant as a line in the i^a;-plane. On the other 
hand, owing to the imaginary coefiicient in (23), the function ypn 
is real only for special values of tj so that it can be pictured as a 
line in a fixed plane only at isolated instants and cannot be 
visualized as readily as . 

In dealing with Un we may restrict ourselves entirely to the 
range 0 < a; < Z; the values of Un outside this range do not enter 
into the computations concerning the physical properties of the 
string. But the values of ypn outside this range are important; 
for example, the discontinuities of d\pn/dx at the ends of the box 
(discontinuities that could be disregarded were we permitted to 
work entirely within the region 0 < a; < Z) affect the distribution- 
in-p of the particle, as was shown in Exercise 4^^ 

The value of in Un is obviously of physical interest; the 
value of ,4n in '/'n , on the other hand, does not affect any of the 
physical conclusions^® to be drawn from the assertion that the 
state of the system is specified by the i^-f unction (25). For 
example, if the displacement of the string is specified by (19), 
its energy is certainly (21), while if the particle is in the state 
(26) its energy is certainly (24); and we note that An plays an 
important part in (21) but does not appear in (24) at all. 

The frequency of Un , given by (17), is proportional to n; while 
the freqijiency of > given by (26), is prox)ortiotial to . 

A very important dissimilarity in physical interpretation arises 
in connection with superposition. For example, if the displace- 
ment of the string is 

w = til 4* ^ > (27) 


If 0 is the temperiiture and <r is the diffusivity (that is, the ratio of the 
thermal conductivity to the specific thermal capacity) of a homogeneous 
body, then the one-dimonsional equation for the conduction of heat is 

This equation resembles (23) much more closely than does 

dx^ c 

(7); but to make the resemblance complete the diffusivity has to be taken 
as a pure imaginary, and for this reason the analogy between (23) and the 
heat-flow equation is (lra\vn rathtir seldom in elementary work. 

It is for this reason that we may once for all assign to An in (26) a 
value which normalizes - 



THE SCHEOEDINGEE METHOD 


246 


§42 


• where ui and tii are given hy (19), then the energy of the string is a 
certainty, being given by (22) as 

F = TFi + TT* = ^ + 4At), (28) 


where Wi and W 2 are the energies associated, respectively, with the 
first and second harmonics in (27). On the other hand, if the 
state of the particle in the box is specified by the Schroedinger 
function 

+ lAs , (29) 


where fi and are given by (25), then the energyof the system is 
not a certainty, in fact, according to the theorem on p. Ill, the 
possible results of a precise energy measurement are then Ei = 
h^/SmV and E 2 = and their respective probabilities are 


i^l 


Uip + 


A2\^ 


and 


\A2 


l^lP + 


^2r 


(30) 


Note that the simplicity of the result (30) is due to the fact that 
the normalizing factor of (25) is independent of n. 


Exercises 


1. Consider the second Schroedinger equation = ihdil//9t for one 
dimension, restrict yourself to its solutions of the special form if/ *» 

and show that this equation then separates into the two ordinary 
differential equations 

HX = EX and ih^ = ET, (31a, b) 

where E is the constant which is introduced in the process of separating 
the variables and which is arbitrary until the conditioas of good behavior 
are imposed.^ Kote that (31a) is just the one-dimensional first Schroe- 
mger equation. Solve (31b), and correlate these computations with 
those on page 86. 

2. Discuss the effect of the choice of the origin from which potential 
energy is measured (a) on the frequency of Un for a string, (b) on the 
frequency of for a boxed particle, and (c) on the distributiona-in-a: 
for stationary and for nonstationary states of a boxed particle. 



Chapteb VII 

CONSTANTS OF MOTION 


43. Time Derivatives of Averages of Dynamical Variables 


If the state of a one-dimensional dynamical system is specified 
by a normalized Schroedinger function then, according to the 
expectation formula 2^^, the average value of a dynamical variable 
a for this state is 


av^ a 


j dXj 


<i) 


where a in the integrand is the Schroedinger operator associated 
with the dynamical variable a. Now, in view of the second 
Schroedinger equation both ^ and ^ arc time-dependent; in 
fact, in the one-dimensional case, they satisfy the equations 


d\l/ 




vt, 




+ (2a, b) 

m n 


Consequently av^ a is usually a function of even when a does 
not involve t explicitly. 

The time derivative of av^ a is, according to (l), 

— av^a = ^ J ^a\f/dx. (3) 

If a does not involve t explicitly, wc get, differentiating in the 
right side of (3) under the integral sign, 

= + (4) 

Averages of dynamical variables and time derivatives of averages 
of dynamical variables satisfy, in quantum mechanics, relations 
similar to those satisfied by dynamical variables and time deriva- 
tives of dynamical variables in the classical theory. For example, 
according to (4) and (2), 

~ ^ / (iW" — dx; 

247 


( 6 ) 



248 THE SCHROEDINGER METHOD §43 

integration by parts reduces the last integral to — (ih/m) J Sxj 

and hence 

( 6 ) 

Comparing lihe right side Of (6) with that of (l), and recalling 
that —ifid/dx is the Schroedinger operator associated with the 
momentum p, we finally rewrite (6) as 

d I 

'T, a,\v X == —8LV^ p, (7) 

Thus, for an arbitrary normalizable state of an arbitrary dynami* 
cal system, the quantities d av^ x/dt and p satisfy an equation 
similar to the classical x = p/m. 

Similarly, 

p = ^ (8) 

eliminating ^fdtloy means of (2b) and bxp'Jdt by means of the 
equation obtained by differentiating (2a) with respect to r, and 
integrating by parts, we reduce the integral to — / p{bV / dx)\p dx ^ 
and get 

d ( dV\ 

C9) 

that is, an equation resembling the classical p ~ — dVfdx. 

Again, substitution of « = H = ~K~V/dx^ + V into <4), tho 
use of (2), and repeated integration by parts, yield d ff/dt = 0, 
that is, 

|av^B = 0, (10) 

another result to be expected by classical analogy. Incidentally, 
we have already shown in §28 that in the case of a/rce particle. 
(when, classically, p is a constant of motion) we have d av^ p/di = 

0, as IS to be expected by classical analogy. 

In deriving Equations (7), (9), and (10). we did not restrict 
o^elves to any particular dynannical system, and in this sense 
toese equations are general; but we did use normalized ^-functions. 

he question now anses, do these equations hold for states speci- 
fied by nonnormalizable ^'-functions? In reply we shall assert. 



249 


CONSTANTS OF MOTION 


§43 


waiving mathematical caution, that the relations between averages 
of dynamical variables and time derivatives of averages of dynamical 
variables which hold for all normalizable states of arbitrary systems 
hold also (if only formally) for nonnormalizable states; accord- 
ingly, we assume that (7), (9), (10), and other similar equations 
hold even for states specified by nonnormalizable 
Narrow jc-packets* The equation 

m^jav^a:= -/ (11) 


which follows from (7) and (9), enables us to indicate an asymp- 
totic connection between quantum theory and classical mechanics. 
Let a particle be almost certainly at a: = aro at a certain instant, 
so that, at this instant, the probability packet ^ is sufficiently 
narrow to justify writing Xa j ^ dx = Xa for / ^x\l/ dx, and 
y'ixa) i ^dz = V'(xa) for / ^V'ix)4>dz. Equation (11) then 
becomes 



and comparison with 3* shows that, at an instant at which the 
x-packet is sufidently narrow, the particle may be said to move 
according to classical laws} 

Equation (12) is a satisfactory approximation only as long as the 
a:-packet remains narrow. Now, in view of the Heisenberg 
inequality, the statement that tlic a:-packct is narrow implies 
that little is known about the momentum of the particle; con- 
sequently, to put the matter somewhat roughly, wo may expect 
that, if tile x-packet is very narrow, then it will soon become 
broad. To illustrate, we shall consider a Kennard packet and 
restrict ourselves to times near t = 0. Let A,(0 denote the 
inexactitude of x at the instant t; then, by Taylor’s theorem. 


Alit) = Alio) + + 

__ dt __ <Mi0 L 


(13) 


and, for a packet which at t = 0 has the Kennard form 4“, we get 
(Exercise 4) 


A*(f) = A^O) -h i 


If 


2 L2m=AK0) 


+ - fHxa-x)V'4'dx i 
mj J<-o 




(14) 


1 P. Ehrenfest, Zeils. f. Physik, 46, 466 (1927). 



250 


THE SCHROEDINGER METHOD 


§43 


Thus varies parabolically with t for times near i = 0, and the 
length of time during which (12) is a good approximation depends 
on the initial value of A* , on the values of F' in the vicinity of 
Xa , on the mass of the particle, and on the value of Planck's 
constant h. 

Exercises 

1. Verify' that (7) holds for the state 19®^ 

2. Show that, if \l/ is normalized, then m d av^ x^/dt — J ^{xp + px)yf/ dx, 

and d® av^ x^/dt^ = 2 f^ip^ — dx. 

3. Let A/, Ap i Xa, and pa denote, respectively, the inexactitudes and 

the averages of x and p at the instant i, and let ^ be normalized; show that 
then TndAltdt « J + px)\l/ dx — 2xapa , and = 

2Ap “I” 2?Mr (jXa “ x')V^yp dx. 

4. Use the results of Exercise 3 to show that, if is given at i = 0 by 
4®®, then (13) reduces to (14). 

6. Show that (14) is consistent with the fact that for the state 19®® of 
an oscillator the x-packet moves without spreading. 

6. Discuss the following curious feature of the reflection coefficient 16®® 
for a single-step barrier: it does not involve h at all, and yet it is different 
from the corresponding classical reflection coefficient, which, of course, is 
zero. 


44. Time Derivatives of Dynamical Variables 

From time derivatives of averages of dynamical variables, 
we now turn to time derivatives of tlie dynamical variables 
themselves. In classical mechanics,’ the time derivative of a 
dynamical variable a is identified by a reference to states specified 
by precise initial conditions; for such states, every dynamical 
variable has an unambiguous numerical value at every instant of 
time, so that the standard definition 

( 1 ) 

r-^0 T 

has an immediate physical significance. According to quantum 
mechanics, however, the value of a dynamical variable cannot 
usually be known precisely for every instant of time even for 
states specified in all physically obtainable detail, and conse- 
quently the right side of (1) does not usually have any clear-cut 
meaning. It is, of course, very convenient to be able to refer in a 
suggestive way to one dynamical variable as the time derivative 
of another, and in order to make this possible in quantum me- 



251 


CONSTANTS OF MOTION 


§44 


chanics we introduce the following definition,^ which makes no 
reference to the classical formula (1) : If the dynamical variables 
OL and P pertaining to a given dynamical system are so related that 
for every state of the system the average of at every instant equals the 
time derivative of the average of a at the same instant j then is said 
to be the time derivative of or, and we write jS = a. Or, expressed in 
symbols, if for every state of the system 

d 

T- av^ a = av^ jS (2) 

at 

at every instant of time, then 

a = |8. (3> 

To illustrate: Having shown that for every normalizable state 
of an arbitrary system 

(7") ^av*a: = iav*2), (4) 

and having later asserted that this equation holds for an arbitrary 
state, we write in quantum mecha^iics 

X = p/m, (5) 

Equation (5) is precisely the equation connecting x and p/m in 
classical mechanics; and, lest this identity in form suggest to the 
reader too great a similarity in the respective physical contents 
of the classical and the quantum-mechanical equations, we must 
emphasize that in quantum mechanics Equation (5) is essentially 
an abbreviation for (4), and means just this: for an arbitrary 
state of an arbitrary system, the time derivative of the expected average 
of X is, at every instant, equal to the expected average of p/m at that 
instant. Similarly, the relations 

|av,p = av,(-^), |av,// = 0, (6) 

obtained in §43, enable us to write 

in the case of conservative quantum-mechanical systems. 

2 G. Breit, Rev. Mod. Phys., 4, 637 (1932). 



252 


.THE SCHEOEDINGER METHOD 


§44 


To identify the time derivative of a given dynamical variable a, 
we must, according to the definition above, rearrange the expres- 
sion d SiV^ ajdt in such a way that it has the form of an average. 
The process of rearrangement (illustrated, for example, by the 
step from 5^^ to 6^^) usually involves integrations by parts, and is 
tedious when a is complicated; but this process can be simplified 
by the use of an important property of the Schroedinger operators 
which we now proceed to discuss. 

Hemitian operators. The expected average of a d3niamical 
variable is, by definition, the arithmetic mean of the possible 
results of a precise measurement of this variable, each possible 
result being weighted by its probability. The possible results of 
measurements of actual physical variables (to which variables we 
restrict our attention) are real numbers, and consequently the 
average of any physical variable for any state at any time must 
be a real number. Now, if a is the Schroedinger operator asso- 
ciated with a dynamical variable a, and if is a normalized 
Schroedinger function specifying the state, then, according to the 
expectation f ormula 2^’, 

av^ a, = j i/onp dx, (8) 

so that, if the average of a for the state p is to be real, the expres- 
sion jfoixj/ ix must he real; that is, in terms of the symbolism of 
2®, we must have 

Im y dx 0 (9) 

for an arbitrary normalizable \p. All Schroedinger operators 
associated with real variables must satisfy this condition. 

The complex conjugate of J dx may be written either as 
/ dx ov as [ f (ai^) dx, and hence we have the identity 

^ ^aij/ dx = f ai/-tpdXj (10) 

where oo[/ denotes the complex conjugate of the function aip, and 
-ip denotes {a\p)ip. If a satisfies (9), the bar over the left side 
of (lO) can be removed, so that 

J ^a\pdx = J cap-i/dx. (ll) 

Now, let u and v be any two normalizable functions of Xj and 
let a satisfy (9), so that (ll) holds also. Then 

S {u + !;)«(-« v)dx = / a(u + v) * (w +• 2^) dx, (12) 



253 


CONSTANTS OF MOTION 


§44 


that is, 

/ iioiu ^ + i mv dz + ^ uav dx ^ van dx 

= !^-udx + ^^v.vdx + ^M.vdx + ^^vudx. (13) 
Because of (ll) the first two terms on the left of (13) cancel the 

S ‘^vdx — J av udx = j ^.vdx — f vaudz, (14) 
a result which can be written equally well as 

jHavdx - ^ uoaidx = f M-vdx - Jau-vdx, ( 15 ) 
that is, as 

2Ini / Uav dx = 2Ini j" au-vdx. (16) 

of function iv instead of in (12), we get, instead 

fuodvdz - / uaivdx = J^.ivdz - f^.ivdx; (17) 

cancellation of ;^he factor t in (17) yields / dx + = 

J ocu^vdx + / aU’VdXf that is, 

2Re / uotv dx = 2Ile / ctu^v dx. ^ 18 ^ 

It now follows from (16) and (18) that' 


J Uavdx = J 


oiU • V dx. 


(19) 


If an operator satisfies (19) whenever u and . are normaliz- 
able, we call It Hermitian, self-adjoint, or real (although, strictly 

Ta " synonymous). We have shown above 

at (19) is deducible from (9); and, since (9) must hold for all 
Schroedinger operators associated with real dynamical variables 
we conclude that every Schroedinger operaior associated with a reed 
dyuo/nviccil vcLTiable i$ HeTHiitidn. 

That any one of the simpler Schroedinger operators satisfies <19> can 

in the case of the* 

momentum operator p - -ihd/3x, we have fiipvk = -ih /nv' l- 
integrating by parts, remembering that u and ® are normalizable, and 



254 


THE SCHROEDINGER METHOD 


§44 


rearranging the result, we get — / Hv' dx = ih ] H'v dx ^ f (ihu^)vdx 
/ (— dx == J V dZf so that 

J iipvdx = y pu-xdCj {20) 

and p satisfies (19>. In view of the deducibility of <19) from (9> we may, 
however, use (19) with any Schroedinger operator associated with a real 
dynamical variable without verifying it for every particular case. 

So far the work of this section has been rigorous; but now we 
shall assert that if' an operator a is Hermitian, that is, if {19) 
holds for every pair of normalizable functions u and v, then it 
holds, at least formally, for every pair of well-behaved functions 
u and y. This assertion, although not always justifiable from the 
strict mathematical viewpoint, will expedite our calculations and 
will not lead us astray in the elementary work. 

Every eigenvalue oj a Hermitian operator is real. To prove this, 
we let u be an eigenfunction of a Hermitian operator a belonging 
tojhe eigenvalue X, and set t; = in {19) ; the result is J* uXu dx = 
f dXj that is, X / wu & = X f uu dx, and hence X = 

If and 7 are commuting Hermitian operators, then the operator 
is also Hermitian. To prove this, we write J upryv dx = 
f 0(yv) dx and note that, if /3 is Eermitian, then J u^(yv) dx = 
J dx, and if y is Hermitian, then J fiu-yv dx == j yfiu^v dx, 
so that, if jS and 7 are both Hermitian, then/ u^v dx = J y^v^u dx; 
when p and y commute, this is equivalent to / iZ^yv dx = J fiyu- 
V dx, and comparison with (19) shows that py is Hermitian. 
In this computation we assumed that not only u and v, but also 
and yv are well-behaved. 

Equations of motion. As we have shown, the Schroedinger 
operators associated with real dynamical variables are all Hermi- 
tian, so that, in particular, the Schroedinger operator H asso- 
ciated with the Hamiltonian of a dynamical system is Hermitian. 
The latter fact can be used in the following way to identify the 
time derivative of a dynamical variable a which does not involve t 
explicitly. If the state is specified by a normalized ^-function, 
then 

( 4 «) = I + ( 21 ) 

The second Schroedinger equation requires that ihdp/dt = 
so that, since H does not involve i, we also have the equation 



255 


CONSTANTS OF MOTION 


§44 


—ih'di>/dt = H4'- Equation <2l) can therefore be ■w’-ritten as 


= H^'Oapdx— J ^oMxpdp^. { 22 ) 


Remembering that H is Hermitian and writing ap for v, f for u, 
and H for a. in {19), we get 


J i/Ha\j/ dx ^ Hyp • ocp dXj 

(23) 

and (22) becomes 


^ av,^ ~ 1 ^ J 

(24) 

that is, 


~ av^ “ ~ y ^\ji dz. 

(25) 

Now, Equation (25) can be written as 


> 

II 

(26) 


where /S is the dynamical variable associated with the operator 

i 

— ctH), H and a being the operators associated, respec- 


tively, with the Hamiltonian and witli the dynamical variable a. 
Equation (26) has the form (2); therefore, assuming that (26) 
holds for all states rather than only for all states specified by 
normalizable f-functions, and recalling our definition of the time 
derivative of a dynamical variable, we conclude ; a is thei dynami- 


cal variable which is assoeialed zoith 


'5 

the operator (Ha 


- aH). 


When the dynamical variable a does not involve t explicitly, we 
may therefore write, using a to denote the operator associated 
with the dynamical variable a, 


dt = ^(,Hcx - aH). 


(27) 


Ec[iLatioa (27) is called the equation oj motion of a; in quantuin 
mechanics, the equation oj motion of a dynamical variable is the 
equation telling what operator i$ to he associated with the time deriva- 
tive of this dynamical variable. 



266 THE SCHROEBINGER METHOD §44 

To illastrate the use of {27), let us identify once more the dynamical 
variable z. The operator associated with x is, according to (27), 

i: = “ (Hx - xIT), (28} 

that is, 


If-— 



/ 

h^ d^\ 1 


L\ 


1^4- TKXjx — a;l 

-sis) -‘■'''•M 


ih / a® 



ih 

/ J 

d 

<29) 

2m 

35 “ 

da?) 

— *2 

2771 

dx' 




X 

= p/n. 


(30) 


where p is the operator associated with the momentum. Passing from 
operators to the corresponding dynamical variables, we get (5), that is, 
the result obtained before by a study of d av^ x/dt. 

In conclusion, we shall indicate the reason for the detailed 
resemblance between the classical and the quantum-mechanical 
equations connecting dynamical variables and time derivatives of 
dynamical variables. In the classical theory, whenever a does not 
depend explicitly on t, d is the negative of the Poisson bracket of 
Hand a: 

(16*^) ^ = -[H, a]. (31) 

Now, the quantum condition. requires that the operator to be 
associated with the Poisson bracket of two variables be —‘i/ft 
times the commutator of the operators associated with the vari- 
ables themselves, so that, if a and ff are the operators associated 
with the variables a and H, then the operator associated with the 
dynamical variable -[H, a} = a should be 

I (Ha - aH\ (32) 

that is, just the operator (27). 

Exercises 

1. Show tkat the operator x is Hermitian but that 9 /ar is not. Under 
what condition is an operator of type c Hermitian? 

2. Show that, if a and |8 are Hermitian, then o: -f /5 and <x^ + /3a are also 
Hermitian. 

3. Show that, if a and^ are Hermitian, but do not commute, then their 
commutator is not Hermitian, but ^(«/3 - /3a) is Hermitian. 



257 CONSTANTS OF MOTION §44 

4. Show that the conciitioii that the operator a be Hermitian is sufficient 
in order that av^ a be real for any (normalizable) state 

6. Use the equation of motion (27) to identify the dynamical variable 
dx^/dt] in comparing the result with the corresponding classical result, 
remember 14^^. 

6. iiet g{x) be an operator, a function of the operator x only and ad- 
mitting of a power series expansion in the operator x. Generalize the 
result of Exercise and show that {d/dx)g{x) “> g{x)(ji/dz) = 
where g‘{x) is the operator, a function of the operator x only, obtained by a 
formal differentiation of g{x) with respect to* x, 

7. Use <27> and the result of Exercise 6 to derive the quantum-me- 
chanical analogue of the classical equation p ~ —dYi^x)ldx. 

8- Use (27) to show that d(a ^)/di « <3: + and dot^/dl = dej8 -f- 
and explain what these equations mean. 

9. Review Exercise 2** in the light of the results of the present section. 

10. Show that the quantum condition jg consistent with the require- 
ment that the operators associated with the dynamical variables a, j8, 
and [ccj jS] have only real eigenvalues. 

11. Using the explicit form of the one-dimensional Schroedinger operator 
Hf verify (23) for the case of normalizable ^’s by integrating by parts. 
Equation (25) can thus be derived without direct reference to the general 
result (19). 

45. Constants of Motion 

If the time derivative of the dynamical variable a pertaining to 
a given dynamical system is zero (that is^ quantum mechanically, 
if d av a/dt is zero for every state of the system), then a is called a 
constant of motion of the system. 

The reader will verify that the right side of 25^* vanishes for 
every possible \l/ if and only if the operators a and H commute, 
and that consequently the simple criterion by which we may 
recognize a quantum-mechanical constant of motion is the follow- 
ing: if the dynamical variable a does not depend explicitly on t 
and if 

aH - Ha <1) 

(that is, if the operator a commutes with the Hamiltonian operator 
of the given system), then the dynamical variable a is a constant 
of motion of this system. This result, which can be inferred 
directly from the equation of motion 



258 


THE SCHROEDINGER METHOD 


§46 


is usually abbreviated thus: if a commutes with the Hamiltonian 
of the system and does not depend explicitly on t, then a is a constant 
of motion of this system. 

Now, if a commutes with H, then a also commutes with H, and 
therefore, if a is a constant of motion for the given system, then 

is also a constant of motion for this system. Hence, if a is a 
constant of motion, then the time derivative of the average of each 
of its powers is zero for every state of the system; and, recalling 
that the distribution-in-o: is determined by the values of the 
averages of the powers of a, we get the result : if a is a constant of 
motion^ then the disirihution-in-a for every state of the system is 
stationary,^ 

If c is a numerical constant, then 

c = ^iHc - cH) = 0, (3> 

SO that any numerical constant such* as 1, 2, m, t, and so on, is a 
(trivial) constant of motion of every dynamical system. When 
enumerating the constants of motion of a system, we always omit 
the mention of numerical constants. 

If the given system is conservative, then H is independent of t 
and we have 

H = ^(JiH - HE) = a, (4) 

SO that the ener^ of a conservative system is a constant of motion 
of this system, and, in particular, the distribution-in-B for any 
state of a conservative system is stationary. Similarly, when the 
system is conservative, then any function of H not involving t 
is a constant of motion. 

In the case of a free particle, we have B = — so that 
the momentum operator p = —ihd/bx commutes with E. Hence 
the momentum is a constant of motion of the free particle, and 
for any state of a free particle the distribution-in-p is stationary. 

It follows from the concluding remarks of §44 that every 


* We emphasize the ewry^ because in the exceptional case of an 
energy state (when in the expectation formula 2^"^ the time factor of -Jf 
cancels that of the distribution-in-a is stationary even if a is not a 
constant of motion, provided a does not involve t explicitly. 



259 


CONSTAIMTS OF MOTION 


§45 


constant of motion of a classical dynamical system is also a 
constant of motion, of a corresponding quantum-mechanical 
system. For example, the momentum of a free particle, being a 
constant of motion in the classical case, is also a constant of motion 
quantum mechanically. [A more elaborate example is provided 
by the dynamical variables L, Lx , and Lz (the magnitude and 
the three components of the orbital angular momentum of the 
particle) when the particle moves in a three-dimensional radial 
field : L, Lx jLy, and Lz , being constants of motion of a classical 
particle in a radial force field, are, as we shall presently see, also 
constants of motion of a quantum-mechanical particle in a radial 
force field.] 

If the state of a classical system is specified by a maximal set of 
conditions (that is, by an equivalent of the precise initial conditions 
of motion), then each constant of motion has a definite numerical 
value that is independent of t. The situation is different in 
quantum mechanics, where maximal sets of conditions are, so to 
speak, less stringent ; thus the value of a constant of motion maybe 
uncertain even though the state is specified by as many conditions 
as it is possible to impose without contradictions; and in some 
cases the precise knowledge of the value of one constant of motion 
may even imply that the value of another constant of motion is 
certainly uncertain. [To illustrate: when a particle moves in a 
radial force field so that L* , I/j, , and Lz (the components of orbital 
angular momentum) are constants of motion, then it turns out 
that for every state for which Lx has a definite numerical value 
other than zero the values of both Ly and Lz are bound to*be 
uncertain; but the distribution-in-i* , tiie distribution-in-Jvy , and 
the distribution-in-L, are all stationary for every state; similarly, 
for a state for which Ly has a definite value other than zero, the 
values of Lz and Lj are bound to be uncertain; and ho forth.] The 
fact that a precise knowledge of the value of a constant of motion a 
may be incompatible with the precise knowledge of the value of 
another constant of motion shows up mathematically through 
the noncommutability of the operators a and jS. 

Exercise 

1. Correlate and the results of ItJxerciacs and with the 

results of this section. 



260 


THE SCHROEDINGER METHOD 


§46 


46. Dirac’s Expansion Theorem 

If the state of a system is described by a i/'-function that is an 
eigenfunction of an operator a, we call this state an eigenstate of 
a] for example, as we already mentioned in §18, an energy state 
of a system, being described by an eigenfunction of the operator H 
pertaining to this system, is called an eigenstate of H, Accord- 
ingly, the assertion that a state is an eigenstate of a means that 
the value of the dynamical variable o: for this state is a certainty. 
If the i^-function describing a state is a simultaneous eigenfunction 
of the operators a, jS, and so on, we call this state a simultaneous 
eigenstate of a, /3, and so on. 

If the respective operators associated with certain dynamical 
variables commute with each other, then we say, for brevity, that 
these dynamical variables commute with each other; in particular, 
if the respective operators associated with certain constants of 
motion of a given system commute with each other, we say that 
these constants of motion form a set of commuting constants of 
motion of this system. 

Our interest in commuting constants of motion is due prin- 
cipally to a theorem that we shall indicate somewhat unprecisely 
as follows: if 


0:2 , as , • • • , a:& , (l) 

is a set of commuting constants of motion of a dynamical system^ 
then every state of this system is either a simultaneous eigenstate 
of the k quantities (l), or a superposition of such simultaneous 
eigenstates.^ In other words, every state of a system can be expanded 
in terms of the particular states that are simultaneous eigenstates of 
all the constants of motion. This theorem can be derived from the 
following fundamental expansion theorem enunciated by Dirac: 
an arbitrary can be expressed linearly in terms of the eigen- 
functions of a Hermitian operator. No entirely adequate general 
proof of the latter thfeorem has as yet been constructed, however. 
In the sequel we shall use the term expansion theorem to denote the 

« Strictly , speaking, every particular simultaneous eigenstate of the 
quantities (1) is itself a superposition of the various simultaneous eigen- 
states of these quantities, a superposition in which all states except this 
particular one enter with zero weight; our statement, which we adopt 
because of its suggestiveness, is therefore tautologic. 



261 


CONSTANTS OF MOTION 


§46 


theorem stated above in italics. This theorem was found to hold 
in the case of every particular system that has been investigated, 
and the reliance on its correctness has been a most valuable guide 
in the development of quantum mechanics, for example, in the 
theory of atomic and molecular spectra. The theorem is perhaps 
approached best through the branch of mathematics called group 
theory) but we shall adopt it without further question. 

To illustrate: In the case of a one-dimensional free particle, a set of 
commuting constants of motion is 


H = 


1 ^ 
K 



( 2 ) 


SO that the expansion theorem claims that every state of a one-dimensional 
free particle is either a simultaneous eigenstate of H and p, or a super- 
position of such simultaneous eigenstates. A reference to §28 proves this 
claim to be correct: the simultaneous eigenstates of H and p arc 6®® and 
7’®; the general stationary state, 5*®, is a superposition of and 7*®; and 
the most general state, 14^®, is again a superposition of the simultaneous 
eigenstates of H and p. 

The computational utility of the expansion theorem in the study 
of a particular dynamical system lies in the facts that constants 
of motion, other than H and functions of //, are usually easier to 
handle than H itself, and that the theorem permits us essentially 
to restrict our attention to the simultaneous eigenfunctions of the 
operators {l) and to arrange the work as follows; we compute 
first the eigenfunctions of the simplest constant of motion; then, 
using the arbitrary features of the so obtained, wo adjust these 
to be also eigenfunctions of the next-simplest constant of 
motion; and we proceed in this manner from the simpler to the 
more complicated constants of motion until we identify the 
simultaneous eigenfunctions of them all ; and when this is done we 
presumably know that every state of the system is either one of the 
states already computed or a superposition of these states. 

The expansion theorem is, so to speak, the more useful the more com- 
plicated the system (provided, of course, that the system has a constant 
of motion simpler than //), and we are not prepared at the moment to 
illustrate its use in a way that does full justice to its power; the following 
simple example may nevertheless be of interest. 

Let it be desired to compute the energy levels of a free particle; the 
equation = Exp is then 

I a* 



262 


THE SCHROEDINGER METHOD 


so that the computation consists essentially in finding the well-behaved 
solutions of a second-order differential equation. But the expansion 
theorem permits us to avoid actually solving (3) for as follows: Our 
system has the commuting constants of motion <2>, of which p is the 
simpler; in fact, the eigenvalue equation = p'yp is 

(4) 

that is, a first-order differential equation whose general solution 

(4«) ^ (5) 

is well-behaved if p' is real. The eigenfunctions of p being available, we 
proceed to find next the simultaneous eigenfunctions of p and H by adjust- 
ing the number p' in (5) so that (5 ) wou ld satisfy (3). Substitution of (6) 
into (3) yields the result p' — and hence the simultaneous eigen- 

functions of H and p are 

4, = ( 6 ) 

that is, apart from the time factors, just the functions 6*® and 7*®. The 
expansion theorem now claims that any state of a free particle is a super- 
position of these simultaneous eigenfunctions (which must, of course, be 
multiplied by the appropriate time factors), so that our work is completed. 

To summarize : in the case of the free particle, the expansion theorem 
enables us to solve the first-order equation (4) and then to adjust the 
solution to satisfy (3), rather than to solve outright the second-order 

equation (3). Both (3) and (4) are of course so simple that the use of the 

theorem in this particular problem is hardly of assistance. 

47. Dynamical Variables Having no Classical Analogues 

Each specific operator that we have used so far was introduced 
because it had been identified as the operator associated in the 
Schroedinger scheme with a dynamical variable that we desired to 
study; and each was, of course, a linear operator capable of 
operating on functions of x. Now, are there linear operators 
that are capable of operating on functions of x but that are not 
associated in the Schroedinger scheme with classical dynamical 
variables? If such operators exist, are they useful in quantum- 
mechanical computations? The answers to both questions are 
in the afl&rmative ; and in this section we shall give two examples of 
operators of this kind, their use to be illustrated in the next. 

Let us consider first the mathematical operation of substituting 
—X for a; in u{x) and introduce the operator, to be denoted by 12, 
which, when operating on u(x)j converts it into t4( — x), so that 

12t4(x) = u(— x) (1) 



263 


CONSTANTS OF MOTION 


§47 


for every u{x)] for example, Qe' = 12 sin x = ain(— a;) = 

— sin a;, 12 cos a: = cos (—re) = cos Xy 12(rc^ + 3a:) = a:^ — 3a:, and 

so forth. Equation (1) enables us to compute the result of operat- 

ing with 12 on any given operand and hence defines 12 completely, 
even though we may not be able to express 12 in terms of operators 
that are more familiar to us. 12 is a linear operator, since for any 
u{x) and v{x) we have 12 + «;) = 12w -+• 122 ; and ^cii = cl2w. 

Let us next compute the eigenvalues of 12, that is, the numbers X 
for which the equation 

(15^) 12w(a:) = \u{x)y (2) 

that is, u{—x) = \u{x)y (3) 

has well-behaved solutions. Operating with 12 on both sides of 
(3), we get 12w(— a:) = 12Xw(a:), that is, 

u(x) = \u{—x)y (4) 

and elimination of u between (3) and (4) yields = 1, that is, 

\ = dbl, (5) 

so that 12 can have no eigenvalues other than 1 and —1. Setting 
X = 1 in (2), we get 12u(a:) = u{x)y that is, u{—x) = u{x)\ this 
equation is satisfied by every even function of x and, in particular, 
by every well-behaved even function of x] consequently, 12 has 
the eigenvalue +L and every well-behaved even function of x is an 
eigenfunction of 12 belonging to the eigenvalue -f-l. Setting 
X = — 1 in (2), we find in a similar way that 12 has the eigenvalue 

— 1, and that every well-behaved odd function of x is an eigen- 
function of 12 belonging to the eigenvalue —1. Each of the two 
eigenvalues of 12 is degenerate to an infinite degree. 

If an operator is explicitly expressible in a reasonably simple 
way in terms of —ihd/dx, a:, and c, we have as a rule no difficulty 
in identifying the classical dynamical variable with which this 
operator is associated; b\it our operator il turns out not to boar 
any simple relation to the elementary Schroodinger operators, 
and we find it impossible to interpret it in any clear-cut way as 
being associated with a classical dynamical variable. As we shall 
see, the operator 12 can, nevertheless, be employed to advantage in 
certain mathematical quantum-mechanical calculations, in which 
it is treated on the same footing as the operators that are asso- 
ciated with classical dynamical variables. To make the ter- 



264 


THE SCHROEDINGER METHOD 


§47 


minology as uniform as possible,, we then speak of the 'dynamical 
variable with the understanding that this dynamical variable 
has no classical analogue ; thus if is a normalized Schroedinger 
function, and if the integral 

J f dXj (6) 

that is, ■/ ^{x)\p(~-x)dx, should come into play in a mathematical 
calculation, we would refer to (6) as the average of the dynamical 
variable 12— -or simply the average of 12— for the state and 
would denote (6) by av^ 12, even though the 'dynamical variable 
12’ has no classical analogue. Similarly, if a state is an eigenstate 
of 12 belonging to the eigenvalue 1 of 12, that is, if it is specified by a 
^-function which is even in Xj then we find it convenient to say that 
the value of 12 for this state is certainly 1 , even though we are 
unable to describe an apparatus which is capable of measuring 
the 'dynamical variable ^2.’ 

In general, if a linear operator a finds application in the mathe- 
matical machinery of quantum mechanics, we speak of the 
dynamical variable a associated with it; and if, by using backwards 
the procedure for associating operators with dynamical variables, 
we find it impossible to identify a classical dynamical variable 
corresponding to the operator a, we say that the dynamical 
variable a has no classical analogue. This manner of speaking is, 
of course, merely a matter of terminology. 

We consider next the linear operator, to be denoted by Ti or 
simply by I, which, when operating on u{x), converts it into 
u{x + Z), where Z is a prescribed real constant, so that 

rw(rc) = u{x + Z) (7) 

for every u{x ) ; for example, le® = 6®'*'^ = e^'e®, P sin rc = sin(a; +- Z), 
r(a;^ + 3a:) — (x -h Z)^ + 3(x + Z), and so forth. The operator P 
is of interest in connection with certain dynamical systems (§49), 
and hence it is sometimes convenient to speak of it as associated 
with a dynamical variable P, a variable to which no classical 
analogue can usually be ascribed. 

To determine the eigenfunctions and eigenvalues of P, we seek 
the well-behaved solutions of the equation 

(15®) Tn = \u, 

that is, u(x + 1) = \u, 


( 8 ) 

(9) 



265 


CONSTANTS OF MOTION 


§47 


where X is a constant. To begin with, we suppress for a moment 
the conditions of good behavior and note that, if i; is a periodic 
function of x with period Z, that is, if 

v{x + Z) = v{x), (10) 

then operating with F on v is equivalent to multiplying v by unity; 
that, if a is a constant, then operating with F on the function e"** is 
equivalent to multiplying e®* by the constant and that the 
function 

u = + Z) == v(x)] (11) 

(where A and a arc constants and y, as indicated, is periodic in x 
with period 1) is the most general function which, when operated on 
by F, becomes multiplied by a constant. Returning to the 
boundary conditions, we now note that since v is periodic the 
function (11) is certain to be ill-behaved in the infinite region 
unless a is a pure imaginary, and that consequently we must 
restrict ourselves to functions of the form 

u = c%, (Ka; + Z) = vix)], (12) 

where fc is a real constant and where A has been incorporated in v. 
The factor c’** and all its derivatives being continuous, the func- 
tion (12) will satisfy our continuity conditions if v docs so, and 
hence the eigenfunctions of the operator F arc functions of the 
form (12), where v is any well-behaved function periodic in x 
with period Z. 

Operating with F on (12), we g(!t 

Te'^ix) = e*'’+”t;(a: + Z) = e**'e‘'%(»). (13) 

The eigenvalue of F belonging to the eigenfunction (12) is conse- 
quently ; this eigenvalue is in general complex, and hence the 
dynamical variable F is to be regarded as complex rather than real. 

Exercises 

1. Show that (1 -j- n)a « 2(1 -1- n), (1 - n)» « 2(1 - n), (1 4- n) (1 - «) - o, 

rnr « n and « i. 

2. Show that the operator 12 is Hermitian and that the operator F is 
not Hermitian. 

3. Show that for the state 10*^ of an oscillator av n »» exp (— a:?/®*)- 
Remember 23®*. 



266 


THE SCHROEDINGER METHOD 


§47 


4. Show that for the state of an oscillator av ft" equals exp (—xo/a*) 
if n is odd and 1 if n is even; then show that the ‘distribution-in-ii’ for 
this state is: 


Possible values of Q 

1 

- 1 

Their probabilities 

J[1 + exp (-a^/o*)l 

i[l - exp (-x§/o*)] 


Note that this distribution is stationary, even though the state is not 
an energy state. 


48. Constants of Motion Having no Classical Analogues 

A dynamical variable having no classical analogue may commute 
with the Hamiltonian operator, and consequently a quantum- 
mechanical system may have constants of motion in excess of 
those suggested by classical analogy. These extra constants of 
motion are just as useful in the study of the system as are those 
that do have classical analogues; in particular, the expansion 
theorem of §46 draws no distinction between constants of motion 
that do have classical analogues and those that do not. 

To illustrate : we consider a particle moving in one dimension in 
a potential field which is even in x, so that 

V(^x) - 7 ( 0 ;); ( 1 ) 

unless 7(o;) is a constant and the particle is free, classical theory 
then suggests that only ff and functions of H are constants of 
motion. In fact, however, the dynamical variable U defined by 
1^^ is also a constant of motion in this case ; indeed, for an arbitrary 
u{x) on which the indicated operations can be performed, we have 

UHuix) = + 7(o;)w(o;)J (2) 

= -- + F(-a:)u(-a:) 

r /iM 9(-a:r 

[see (1)] j 

= V(.x)ui-x) 

= Hu{—x) = HQu(x); 


hence 


QH = HQf 


( 3 > 



267 


CONSTANTS OF MOTION 


and Q satisfies the criterion 1^®. Thus a system whose potential 
is even in x has at least the two following constants of motion: 

H, Q. <4) 

Let us now see to what extent the constant of motion is 
helpful in computing the eigenfunctions of H when V is even in z. 
When applied to the set (4), the expansion theorem asserts 
that every eigenfunction of H is either a simultaneous eigen- 
function of H and or a superposition of such simultaneous 
eigenfunctions. Hence, when V is even in x, the essential part 
of the search for the eigenfunctions of H is the identification of the 
simultaneous eigenfunctions of 2 and H. Now, as shown in §47, 
the eigenfunctions of Q are the well-behaved even functions of x 
and the well-behaved odd functions of x. Consequently the 
expansion theorem asserts that in seeking the eigenfunctions of H 
when V is even in x we may from the outset restrict ourselves to 
operands that are even in x and to operands that are odd in x, 
and need not consider operands that are neither even nor odd in x 
(with the assurance that, if H has eigenfunctions that are neither 
even nor odd in x, then these eigenfunctions are superpositions of 
eigenfunctions of H that are even in x and eigenfunctions of H 
that are odd in x, and may be obtained later from the simultaneous 
eigenfunctions of H and 2 by taking linear combinations of the 
latter). Note that the claim “every eigenfunction of H that is 
neither even nor odd in a: is a superposition of an eigenfunction of H 
that is even in x and an eigenfunction of H that is odd in x" is not 
to be confused with the almost trivial* claim that “every function 
that is neither even nor odd in a: is a superposition of a function 
that is even in x and & function that is odd in x.” 

The result obtained above with the help of the expansion 
theorem and of the dynamical variable 12 has been to a certain 
extent anticipated in our previous discussion. For example, we 
showed in theorem 29 of §32 that, if 7 is even in x and the motion 
is bound, then every eigenfunction of H is either even or odd in 
the possibility that both even and odd eigenfunctions of B. could 
belong to the same eigenvalue of H did not arise there, since in the 
bound one-dimensional case the eigenvalues of H are nonde- 

•Let m(») be arbitrary; then /(s) =« }[«(») + «(—»)] >3 even in *, 
g{x) = §[«(«) — «(-*)] is odd in *, and m(®) can be written as m(*) “ 
/(») + ^(»), that is, as a superposition of an even and an odd function. 



268 


THE SCHROEDINGER METHOD 


§48 


generate. Again, in §39, we were able to devote our attention 
separately to the even solutions and to the odd solutions of the 
Schroedinger equation for the even potential of Fig. 66. 

Potential lattices. The usefulness of the constants of motion 
not having classical analogues is illustrated perhaps more forcibly 
when the potential is periodic in x with period Z, say, so that 


V{x) = V(x + 0; 


(5) 


unless V (x) is a constant and the particle is free, classical theory 
then suggests that only H and functions of H are constants of 
motion. However, using the operator T defined by 7^^ we get for 
an arbitrary u on which the indicated operations can be performed 


VHu{x) = r — - -^ u(x) + Vix)u{x) 
L ^ 


( 6 ) 


[see (5)] 


hence 


K d{x + Z)^ 
1 d' 


u(x + Z) + F(a; +* l)u{x + Z) 


fc dx^ 


u{x + Z) + V(x)u{x + 1) 


= Hu{x + Z) = HVu{x ) ; 

m = Hr, 


(7> 


and r is a constant of motion. Thus a particle moving in a 
periodic potential or, as we shall often say, a particle moving in a 
potential lattice , has at least the two following constants of motion; 

H, r. (8) 

The expansion theorem now asserts that, when the particle 
moves in a potential lattice, then every eigenfunction of H is 
either a simultaneous eigenfunction of H and T, or a superposition 
of such simultaneous eigenfunctions. Hence, when V is periodic 
in Xf the essential part of the search for the eigenfunctions of H 
is the identification of the simultaneous eigenfunctions of T and 
H. Now, the eigenfunctions of T are 

(12"') u = e%, [K* + Z) = »(x)], (9) 

where fc is a real constant and t; is a well-behaved periodic function 
of X of period 1; consequently the expansion theorem asserts that in 
seeking the eigenfunctions of H for a potential lattice we may from 



269 


CONSTANTS OF MOTION 


§48 


the outset restrict ourselves to operands of the form (9), with the 
assurance that the eigenfunctions of H which are not of the form 
(9) are superpositions of eigenfunctions of this form and can be 
obtained later by taking linear combinations of the latter. This 
assertion constitutes the one-dimensional form of Blaches theorem^ 

An example of a potential lattice will be taken up in the next 
section. 

Exercises 

1. Explain why the stationary character of the distribution-in-n com- 
puted in Exercise 4" is not unexpected. 

2. If a/5 = —/3a 7 ^ 0, then the operators a and /3 are said to anticommute. 
Show that the operators Q and d/dx anticommute and that the operators 
r and d/dx commute. 

3. Verify that every momentum state is an eigenstate of r. This result 
is related to that of Exercise 4’. 

4. Consider a free particle, note that H and Q comprise a set of com- 
muting constants of motion, and verify the expansion theorem insofar as 
this set is concerned. 

6. Consider a free particle, note that 17, p, r (r with an arbitrary 1) 
comprise a set of commuting constants of motion, and verify the expansion 
theorem insofar as this set is concerned. 

49. Potential Lattices and Energy Bands 

An electron in a crystal is subject to the fields of the regularly 
spaced atoms or ions comprising the crystal, and thus moves in a 
three-dimensional potential lattice; the quantum theory of the 
motion of a particle in a three-dimensional lattice is consequently 
of considerable interest. It turns but that some of the char- 
acteristics of such a motion show up oven in the hypothetical 
case of a one-dimensional lattice, to which wo shall restrict our 
attention here. 

Let us then consider a one-dimensional potential lattice of period 
Z, so that 

V(X + 0 = V(X), (1) 

and look for the energy levels of the system, that is, the values of 
the number E for which the Schroedinger equation Hil/ = E4> or, 
more explicitly, the equation 

r + k[E - v(x)]i^ = 0 , ( 2 ) 


’ F. Bloch, Zeiia. /. Phytik, 62, 685 (1928). See also Floquet’a theory of 
differential equations with periodic coefficientB; for example, in Ince. 



270 


THE SCHEOEDINGER METHOD 


§49 


has -well-behaved solutions. Now, having assumed the correct- 
ness of the expansion theorem, we may restrict our attention to 
^’s of the form 

(9«) ^ = e%, Hx + Z) = (3) 


where fc is a real constant and « is a well-behaved periodic function 
of X of period Z. Substituting (3) into (2), we find that, in order 
that (3) be a solution of (2), v must satisfy the differential equation 


fc*” 

v” + 2ikv' -I- K E -Vix) V = 0. 


( 4 ) 


Thus the expansion theorem permits us to look for periodic well- 
behaved solutions of (4), rather than for (periodic or not) well-be- 
haved solutions of (2). To be sure, Equation (4) is somewhat more 
complicated than Equation (2); but the permissibility of con- 
sidering only the periodic solutions of (4) usually more than 
offsets this. 

A rectangular lattice. For a periodic V{x) of actual physical 
interest, the treatment of (4) is complicated at best, and we shall 
turn at once to the idealized lattice of Fig. 74, consisting of an 




b ^ 





n r 


n 

Lu 

LJ 

1 1 L_ 


LJ 


0 


Fig. 74. A rectangular potential lattice. 


infinite sequence of rectangular potential wells of width h sepa- 
rated by potential hills of width c and height Fo ; the period of this 
lattice is 

Z = 5 + c. (5) 

The well whose left edge lies sX x — nl (where n == * • • , — 2, 
— 1, 0, 1, 2, • • • ,) will be called the nth well; the hill whose left 
edge lies eXx ~ nl -- c will be called the nth hill. 

Energies smaller than Vo, We begin with the cstse 

0 < ^ < 7o , (6) 


and, adopting the 'abbreviations 

(6“) a = V^, P = Vk(Fo-JB), 


( 7 ) 



271 


CONSTANTS OF MOTION 


§49 


write (4) separately for a well and a hill, getting the equations 

ft)" + 2tA;t)' -|- (a — k')v = 0 (a well) (8) 

|t)" + 2ikv' — (jS" + k^)v = 0 (a hill), (9) 

and iSien find the respective general solutions of (8) and (9), getting 
_ (the nth well) 

(the nth hill), 

where the A^Sj jB's, C’s, and D's, are arbitrary constants. 

We now proceed to adjust v so as to make it periodic with 
period Z. Let a; be a point within the 0th well and x + nl the 
corresponding point in the nth well. The periodicity of v is then 
assured insofar as the wells are concerned if 

= A «+«(*+«)) j ^ 

for all values of x between 0 and 6, that is, if 

An = Aoe-"'*-*’"', Bn = Boe‘‘“+*'>"'. (12) 

A similar consideration of the hills yields the equations 

Cn = Bn = (13) 

The requirement of periodicity, resulting in (12) and (13), thus 
reduces the number of arbitrary constants in (10) from infinity to 
just four, say Ao , 5o , Co , and Do .* 

Our next task is to adjust Ao , Do , Co , and Do so that v is well- 
behaved. Since v is finite for any finite value of x and is periodic, 
it automatically remains finite when | a; [ oo . The conditions 
for continuity of v and v' at a: = 0 are, according to (10), 

Ao -j- Do = Co -|- Do (14) 

i(a - k)Ao * iia + A;)Do = (0 - ik)Co - + ik)Do . (16) 

The conditions for continuity of v and at a: — 6 are, according 
to (10) and (13), 

^ (16) 

i(a - - i(a + A:)e“’‘"+*>'’Bo 

- (/3 - fA:)e-''’"‘‘>'Bo - (^ - fA)e‘^+’'*>'=B6 . (17) 



272 


THE SCHEOEDINGER METHOD 


The existence of well-behaved periodic v’s thus hinges on the 
consistency of the four equations (14) to (17). The condition for 
consistency can be found, for example, by expressing Ao and Bo 
in terms of Co and Do by means of (14) and (16), then eliminating 
.4o and Bo from (16) and (17), and finally requiring that the 
dependence of Co on Do given by (16) be the same as that given 
by (17). The procedure is tedious in any case, but the result is 
relatively simple: Equations (14) to (17) have simultaneous 
solutions .4o , Bo , Co , and Do (other than the trivial Ao = Bo = 
Co = Do — 0) if and only if 


cosh /3c cos ah -f 


p — a 
2a/3 


sinh pc sin ah = cos Ik. 


(18) 


We now recall that k in (3) is a reed constant, so that the right 
side of (18) is a real number lying at or between — 1 and 1 ; Equa- 
tion (18) is consequently compatible with the requisite reality of 
k if and only if 

- 1 < cosh Pc-cos ah -f- sinh fic-sin ah < 1. (19) 

The new condition (19) involves the undetermined parameter E 
(besides the prescribed constants m, h, h, c, and Fo) and is in fact 
the condition that an E lying below Fo must satisfy in order to be a 
possible energy val.ue of our system. 

Numerical example. To illijstrate the type of limitation that 
(19) imposes on E, we turn to a numerical example, set 

Fo6‘'k = 144, c = ^6, (20) 

and let E/Vo = r, (21) 

so that (19) becomes 

— 1 < l^cosh IVl — r • cos 12-v/r 

1 — 2r , _1 

2Vr(l-r)®*^^^^^ ~ ^ (22) 

The choice Fob k = 144 is the same as that which we made in 
Exercise 3® while stud 3 dng numerically a single rectangular 
potential well. 

The bracketed term in (22) turns out to be the oscillating 
function of r shown graphically in Kg. 75. We note that for 



273 


CONSTANTS OF MOTION 


§49 


r < .03 (that is, for E < .03 Vo) 
the curve lies above 1 ; values 
of E lying below .03 Vo thus 
fail to satisfy (22) and hence 
are not possible energy values 
of our system. It is also seen 
from the graph, however, that 
(22) does hold for all r's lying 
in the interval from about .03 
to about .06, indicated in Fig. 
75 by a thick horizontal line; 
hence all E's lying in the 



Fig. 76. Graph of the bracketed 
function in (22). 


interval from about .03 Vo to about .06 Vo are possible energy 
values of the system. For r’s just greater than .06, the curve 
drops below —1, so that (22) does not hold, and consequently 
energies just greater than .06 Vo are not possible; but r’s lying 
in the interval from about .13 to about .25 satisfy (22) once 
again, so that all E’a lying in the interval from about .13 Vo to 
about .25 Vo are possible energy values. Fig. 75 implies, in 
fact, that the possible energies of our system lying below Vo 
form four hmids, spaced approximately as follows : 


first band: .03 V<, < E < .06 Vo 

second band: .13 Vo < .F < .25 Vo 

third band: .33 Vo < F < .57 Vo 

fourth band: .64 Vo < F < .99 Vo . 


Any E lying within one of these four bands is a possible energy 
value, so that within a band the energy spectrum is continuous; 
but the consecutive ‘allowed’ bands are separated by ‘forbidden’ 
bands, that is, by continuous stretches of impossible energies. 

The result (23) is shown graphically on the right-hand side of 
Fig. 76, where the allowed energy bands are shaded; the horizontal 
lines on the left-hand side of this figure are the discrete energy 
levels of a single well for which Vob®/t = 144, taken from Fig. 31®*. 
Note that the substitution of an infinite periodic sequence of 
rectangular wells for a single well of a similar shape causes each 
of the discrete levels of the single well to spread out into a con- 
tinuous band. 


274 THE SCHROEDINGER METHOD §49 



Energies greater than Va . Next we take 
up the case 


E> Vo 


(24) 




2 when /3 in (7) is the pure imaginary quan- 


tity ^ = iy, where 

y = V<E'-V^- 


(25) 


Fig. 76. The discrete 

levels for a rectangular The structure of equations (8) to (19) is 
well and the correspond- the same as before, and the correc- 

rectangular lattice. ^lon for the fact that /3 is imaginary can 

be made directly in (19). Substituting 
iy for jS in (19) and recalling that cosh if = cos f and sinh if = 
i sin f, we get the new condition 

2 , 2 

— 1 < cos 7 C-C 0 S o!?) — - sin 7 C*sin < 1. (26) 

2oty 

For the i)articular numerical valuers (20) the condition (26) 
becomes 


— 1 < j^cos 12\/ r • cos ^'v/r — 1 

+ 2'^ r(r^i) -sin | Vr -1 j < 1. (27) 


The bracketed term in (27) is plotted at the right of r = 1 in Fig. 
77, the left part of which is a reproduction of Fig. 75. We note 



Fig. 77. Graph of the bracketed functions in (22) and (27). 


275 


CONSTANTS OF MOTION 


§49 






that the energy spectrum of our system lying above Vo consists 
(just as in the case E < Vq) of continuous allowed bands separated 
by continuous forbidden bands. The lower part of the complete 
energy spectrum of our system is shown on the right-hand side of 
Fig. 78, where the allowed bands are shaded ; the left-hand side of 
this figure, a reproduction of Fig. 31^^ 
shows the lower part of the complete 
energy spectrum of a single well for 
which Vob^K = 144. Note, in partic- 
ular, that the substitution of an in- 
finite periodic sequence of rectangular 
wells for a single well of a similar 
shape causes the continuous energy 
spectrum of the single well to break 
up into bands. 

Concluding remarks. We have shown 
above with the help of the expansion 
theorem that the possible energies of 
a particle moving in the potential lat- 
tice of Fig. 74 are E^s lying below Vo 
and satisf 3 dng (19) and Ws lying above 
Vo and satisfying (26), and that in 

the special numerical case (20) the energy spectrum consists of 
the bands illustrated in Fig. 78. A more thorough treatment 
of the rectangular lattice would include a study of the dependence 
of the energy bands on the relative widths of the wells and the 
hills, a computation of the ^-functions for the energy states, a 
study of the r61e of the parameter k and of the relation between 
k and E, and so on.® 

Energy spectra composed of alternating allowed and forbidden 
bands are characteristic of potential lattices;® the case V{x) = 
constant is of course an exception. The bands for a sinusoidal 
potential lattice are illustrated^*^ (for a particular choice of m and 
of the amplitude and period of V) in Fig. 79(b), where V is shown 
by a thick line and the allowed bands are shaded; Fig. 79(a) shows 
the discrete and the continuous spectra for a single sinusoidal well. 


2 
!ZZZZZ 


Fig, 78. Energy spectra 
for a rectangular well and 
for a rectangular lattice. 


® R. de L. Kronig and W. G. Penney, Proc. Roy. Soc., 130A, 499 (1931), 
considered an idealization of the lattice of Fig. 74 and derived a number of 
results in a closed form. 

• H. A. Kramers, Physica^ 2, 483 (1935). 
w P. M. Morse, Phys. Rev., 35, 1310 (1930). 



276 


THE SCHROEDINGER METHOD 


'/////////////////A 

w/////Mm. 


well and a sinusoidal lattice. 

We must emphasize that our discussion deals with lattices that 
are strictly periodic and thus extend from a: = — ootoa; = oo. 
In physical problems concerning not only the interior but also the 
surface of a crystal it is necessary to take into account the finite 
extent of the actual lattice; similarly, the computations must be 
modified whenever it is desired to take into account the imper- 
fections of the crystal. These matters are discussed in writings 
on the theory of the solid state.^^ 

Exercises 

1. Show that, if c 0 while 6 and Fo are held fixed, then (19) and (26) 
go over into if 6 0 while c and F# are held fixed, then (19) and (26) 
go over into E > Fo;ifFo— > « while 6 and c are held fixed, then (19) 
goes over into 5^° with 2 = 5; if p oo while Fo and b are held fixed, then 
(19) goes over into 19*® with a — §&. Explain why these limiting forms of 
(19) and (26) are not unexpected. 

2. Consider large values of E and use (26) to show that as E increases 
the allowed bands become wider and the forbidden bands narrower. 

3. Just as the substitution of a periodic sequence of wells for a single 
well results in a spreading of each discrete level of a single well into a 
band, so the substitution of a finite sequence of wells for a single well 
results in a splitting of each discrete level of a single well into a multiplct 
of levels. To illustrate: consider a double well consisting of two rec- 
tangular wells each of width h and depth Fo, whose adjacent edges are 
separated from each other by a distance c; and show without detailed 
analysis that, if c is suffi ciently lar ge to make exp [“§V^ic(Fo - j&) c] 
much smaller that exp [i\/ k(Fo — E) c], then the discrete energy spectrum 
consists of a series of doublets (pairs of levels), the separation of the 
members of a doublet being small compared to the separations of the 
doublets, and the positions of the doublets being approximately those of 
the singlet levels of a single rectangular well of width 6 and depth Fo . 

if For an introductory survey and many references see F. Seitz and 
R. P. Johnson, Jour. Applied Phys., 8, 84-97. 186-199, and 246-260 (1937). 





Chapter VIII 


REMARKS ON THE MOMENTUM METHOD 

60. The Framework of the Momeatum Method 

The present short chapter marks an important turning point in 
our study, for, having discussed the Schroedinger method in some 
detail, we now begin to consider methods other than Schroe- 
dinger^s. It should perhaps be emphasized that the work of this 
chapter might have, in principle, preceded our study of the 
Schroedinger method. The reader will recall that the operator 
assumption I and the quantum condition 11 of §13, as well as the 
operand rule IV of §17, are quite noncommittal concerning the 
specific form of the operators and operands to be employed; in 
the light of these general assumptions, the Schroedinger methods 
rests, in fact, on a rather special possible choice of operators and 
operands, a choice, however, of the greatest all-round usefulness. 

A characteristic of the one-dimensional Schroedinger method is 
the prominence that it gives to the coordinate x (this method may, 
in fact, be called the x-method) ; the Schroedinger operands, for 
example, are functions of x (and of t) intepreted as amplitudes of 
distributions-in-a:. The so-called momentum metUbd or p-methodj 
which we shall now outline briefly for the case of one dimension, 
gives the corresponding prominence to the momentum p, and the 
operands used in this method are functions of p (and of i) inter- 
preted as amplitudes of distributions-in-p. 

The operators of interest to us now are those that can operate 
on functions of p, the fundamental ones being p (multiplication of 
an operand by p), d/dp (diffiirentiation of an operand partially 
with respect to p), and c (multiplication of an operand by the 
numerical constant c). The algebra of these operators is of course 
essentially the same as that of the operators Xj d/dx, and c. For 
example, since for an arbitrary differentiable u{p) we have 

~ pw(p) = P ^ w(p) + «(p) “ [p ^ 


277 



278 THE MOMENTUM METHOD §50 


we find, recalling the definition of equality of operators, that 
(a/ap)p = viP/^v) + 1) is, that 




= 1 . 


( 2 ) 


Equation (2) is analogous to 30^. The reader will perhaps readily 
see that were the symbol p substituted for the symbol x throughout 
Chapter I, the mathematical content of that chapter would not be 
altered. 

We shall be concerned with eigenvalues and eigenfunctions of 
operators constructed from p, 5/9p, and c, and consequently a 
definition of a well-behaved junction of p is required. The 
definition is analogous to that in the case of functions of x: u(p) 
is well-behaved if it is not identically zero, if it remains finite when 
I p I — > 00 (that is, when p — oo and also when p — > oo ) and 
if it is continuous and has an adequate number of continuous 
derivatives; we shall not consider the continuity-conditions in 
detail — ^insofar as the work of this chapter is concerned it is 
suflBicient to require u(p) and du{p)/ dp to be continuous, that is, to 
impose on the operands the standard continuity conditions of §3. 
The definitions of the terms ‘eigenvalue^ and ‘eigenfunction' need 
not be reworded, it being understood, of course, that if the operator 
a involves the operators p, d/dp, or both, then u in the equation 


( 15 ^) 


au = \u 


(3) 


is usually a function of p. 

Our next task is to identify the operators, which we denote as 
usual by x and p, to be associated in the p-scheme with the 
dynamical variables x (the coordinate) and p (the momentum), 
respectively. The operators x and p must, in any scheme of 
association, satisfy the relation 


(5“> 


xp — px ^ ih, 


<4) 


aad we note, consulting (2), that the associations 


dynamical variable x operator ih — , . 

dp <5) 

dynamical variable p ^ operator p 

are consistent with (4); further, the eigenvalues of p and ihd/dp 

are all real (Exercises 1 and 2), and consequently the associations 



279 


THE MOMENTUM METHOD 


§50 


( 5 ) are suitable in every respect. These associations are the 
fundamental ones of the p-scheme, and are to be contrasted with 
the fundamental associations 3^^ of the Schroedinger method. 

For a dynamical variable that is the sum of a function of x 
(the coordinate) only and a function of p (the momentum) only, 
we then adopt the association 

dynamical variable a(x) + /3(p) ^ operator a (ih ^ + i8(p), <6) 

which is to be contrasted with ll^^ In particular, the one-dimen- 
sional Hamiltonian 

( 6 ») H = ( 7 ) 

is associated in the p-method with the operator 



where V{ikd/dp) is the same function of ihdjdp as V(z) is of x. 
The operator (8) is to be contrasted with our familiar Schroe- 
dinger operator 18^^; note that the Hamiltonian operator is 
usually more complicated in the p-scheme than in the Schroedinger 
scheme. 

According to the operand rule of §17, the operands of the 
p-method are to be associated with states of dynamical systems. 
In computations carried out entirely by means of the p-method, 
it is convenient to denote the operands by ^ (with the under- 
standing that i/' is a function of p and t, rather than of x and ^) 
and to call them i/^-f unctions, as we shall do in this chapter; but 
whenever the Schroedinger method and the p-method are used 
concurnuitly it is advantageous to adopt the notation described 
at the end of §27 and to denote the Schroedinger ^-functions by 
(x 1 ), and the i/^-functions of the p-sqheme by (p [ ); a justification 
for this use of the symbol (p | ) will be given in Exercise 5, 

To complete the list of assumptions required for simple one- 
dimensional applications of the p-method we add the two fol- 
lowing: 

III?. Every operand ^ of the p-method pertaining to a one- 



280 


THE MOMENTUM METHOD 


§50 


dimmsional system whose HamiUonian is H depends on the time t 
in such a way that 

H4^ = ihj^^ (9) 

where H is the operator (8). 

Vf . When a one-dimensional system is in the state specified in 
the p-method by the operand then the expected average j av^a, 
of any dynamical variable ot is given by the formula 


= 



( 10 > 


where a in the integrand is the operator associated in the p-method 
with the dynamical variable a. 

Equation (9) has the same form as does the second Schroe- 
dinger equation 1^®, while (lO), the expectation formula of the 
p-method in one dimension, resembles the one-dimensional 
Schroedinger expectation formula 2^^. 


Exercises 

1. Use the operator associated in the p-schemc with the coordinate x 
to show that all real (and no other) numbers are possible values of a;, and 
that the eigenfunction belonging to the eigenvalue oj' of a; is ^ = \p{p) = 

2* Use the operator associated in the p-scheme with the momentum p 
to show that all real (and no other) numbers are possible values of p, and 
that the eigenfunction belonging to the eigenvalue p' of p is^ V' “ 'Piv) “ 
A5(p - p'). 

3. How can the distribution function of a dynamical variable for a 
specific state of a one-dimensional system be computed (at least in prin- 
ciple) within the framework of the p-method? 

4. Show that in the p-scheme the operands ^ and where c is a nu- 
merical constant, specify the same state. 

6. Take a in (10) to be a function of p only and show that, in the 
p-scheme, is the distribution-in-p for the state specified by so that 
itself is an amplitude of this distribution. 

6. Working in the p-scheme, show that, if yp is an eigenfunction of an 
operator a belonging to the eigenvalue a of a, then the result of a precise 
measurement of the dynamical variable a when the system is in the state 
^ is certain to be a. 


1 As before, we regard 5-functions as well-behaved. 



281 


THE MOMENTUM METHOD 


§51 


7. Let (x ] ) denote an amplitude of the distribution-in-a; for a state 
specified in the p-scheme by the V'-function (p | ), and derive the result of 
Exercise without reference to the Schroedinger method. 


61. Energy Levels of an Oscillator 


To illustrate the p-method, we shall now use it to compute the 
energy levels of a linear harmonic oscillator. The Hamiltonian 
being 


( 15 ^^) 





( 1 ) 


we find, using that in the p-scheme the operator H for the 
oscillator is 




< 2 > 


SO that our problem is to compute the eigenvalues of the operator 
(2), that is, to find the values of the number E' for which the 
equation Hi/ = Ei/, or 




has well-behaved solutions ; our operands are now fimctions of p. 
The substitutions 

s = p/6, E = ihveX, (4a, b) 

where 

6 = '^kmh^, 2irvc = y/klm, (5) 


carry the differential equation (3) into 

that is, into 3^; note that the abbreviations (5) are the same as 25^ 
and 14“. The eigenvalues of (6) are 

(28') X„ = 2n + 1, n = 0, 1,2, 3, • • • , (7) 


so that, in view of (4b), the possible energies of the oscillator are 
En = in i)h/c , 71 = 0,1,2,3,.-. (8) 

The result (8) is identical with our old result 15*^. 



282 


THE MOMENTUM METHOD 


§51 


Wc have thus found the energy levels of the oscillator by a 
method other than SchroGdinger^s. In the ease of the oscillator, 
the steps of the p-method are quite similar to those of the Schroe- 
dinger method, because the Hamiltonian is quite symmetric in 
X and p. 

Exercises 

1. Compute the eigenfunctions of (3) and show that in the p-scheme 
the normalized t^-function specifying the nth quantum state of an oscillator 
is 41®^, 7 „ being an arbitrary real constant. 

2. Consider the normal state of the oscillator and, working entirely 
within the p-scheme, derive 26^®, 27^®, 18^ ^ and 19^®. 

3. Working entirely within the p-scheme, prove the Heisenberg in- 
equality 13^®. 

4. Use the p-method to compute the energy spectrum of a free particle. 

62. Concluding Remarks 

The p-method may be built up into a theory as comprehensive 
as that of Schroedinger; in particular, the equation of motion 
27*^ is found to hold in the p-scheme as well as in Schroedinger's, 
and the expansion theorem of §46 goes over into the p-scheme 
without a change of wording. But the p-scheme is on the whole 
not as convenient as Schroedinger ’s for handling the more usual 
types of problems. The main reasons for our mentioning it are 
that, at least in its more elementary aspects, it affords perhaps the 
simplest example of a method other than Schroedinger's, and that 
it enables us to emphasize the generality of the operator assump- 
tion I, the quantum condition II, and the operand rule IV without 
invoking operators other than differential operators. 

The questions now arise as to whether the p-scheme is certain 
to yield results in agreement with Schroedinger's, and whether it 
can be deduced directly from the Schroedinger method (that is, 
whether it can be constructed without replacing the assumptions 
III® and V® by other assumptions). The answers are in the 
affirmative, but the proofs, which involve a theorem on differential 
equations due to Laplace,^ are beyond the scope of this book. 

The process of working with the p-method is often called working 
in momentum space] a ^-function of the p-method is often called a 
momentum eigenfunction^ a term not to be confused with the term 
eigenfunction of momentum^ that is, a i/'-function specifying a state 


2 Ince, page 187. 



283 THE MOMENTUM METHOD §52 


for which the momentum is a certainty. The term ^p-representa- 
tion’ is sometimes used for 

It should perhaps be added that when the p-scheme is used in handling 
potential lattices the equation H\J/ = assumes a rather interesting form. 
For example, let V{x) = Focos ax, that is, let 

V{x) = iFoe-^ -f (1) 

where Fo and a are prescrib.ed constants; according to 8®®, the operator 
associated with H in the p-scheme is then 

1 ahS- 

^ ^ d- + iFoe . (2) 


To find what happens to a function of p when the operator exp {ah d/dp) 
acts on it we expand exp (ah d/dp) formally into a power series and get 


ah: 


dp 


.51 a® 1 08 

1 •{-ah'- — )r ^ a? f- a^h? — -1- 

dp 21 ap* 3! ap8 


( 3 ) 


an equation which implies that 


p dp 




^(p) + aH’iv) + a*^“if'"(p) + ■ • • , 


(4) 


where the primes denote differentiation with respect to p; now, by Taylor’s 
theorem, 

^(p) + ahtj/^ip) + ^ a8^8^"(p) 4* *•• = ^(p 4- ah), (5) 


and, waiving mathematical caution, we conclude that 

ah.t 

e ^p{p) ^ ^(p 4 - ah), (6) 

that is, that operating on \l/{p) with the operator exp {ah d/dp) results in 
replacing p in yp{p) by p + ah. In a similar way 

e ^f^{p) Mp - ah), (7) 

Thus when H is (2) the equation = E\f/ is 

^ p»^(p) + iFo^(p 4- ah) 4- iFo^(p - ah) « E^{p), (8) 


Note that (8), an equation to be solved for ^ and E under the condition 
that yp be well-behaved, is not a differential equation; equations of type 
(8) are called difference equations. 



284 


THE MOMENTUM METHOD 


§52 


Exercises 

1. Starting with the result found in the paragraph following 4*®, show 
that, if V{x) is even in a;, then every eigenfunction of H in the p-scheme is 
either even or odd in p, or is a superposition of an even-in-p and an odd- 
in-p eigenfunction of H, 

2. Let Qp denote the operation of replacing p by —p, so that f2p/(p) = 
/(—p) for every /(p). Use the operator Qp and the expansion theorem to 
derive the result of Exercise 1 without reference to the Schroedinger 
method. 



Chapter IX 

LINEAR OPERATORS AND MATRICES 

We have by now considered two distinct methods of associating 
specific operators with dynamical variables: the Schroedinger 
method and the p-method. The next method on our program is 
that of Heisenberg, which employs operators' quite different from 
those that we used before, and associates dynamical variables with 
mairices. The present chapter is devoted to a preliminary 
mathematical discussion, which is intended for a reader whose 
knowledge of matrices is at best vague and, which consequently 
begins in a rather elementary way. The more essential parts of 
this chapter are listed in §61. 

If the reader finds any difficulties in connection with the ele- 
ments of the theory of simultaneous linear algebraic equations 
encountered in this chapter, ho should consult a book on algebra. 

63. Mappings of a Plane 

Vectors and rays. Let the various points of a plane, say the 
plane of Fig. 80, be denoted by 0, u, v, w, and so on. With every 
point of the plane we asso- 
ciate a vector with a ter- 
minus at the point in ques- 
tion and with an origin at 
0; the point and the cor- 
responding vector will be 
denoted by the same sym- 
bol, and the terms point 
and vector will be used in- 
terchangeably. The symbol cu, where c is a real number, denotes 
the vector whose length is | c | times that of u, and whose direc- 
tion is the same as that of « if c is positive and opposite if c is 
negative. The vector whose origin and terminus are both at 0 
is called the vector zero and is denoted by 0. The locus of the 
termini of all vectors of the form cu, where u is fixed but c differs 
from vector to vector, forms a ray, a ray can be specified by a 
single nonvanishing vector lying on it, and may be pictured as 

2S6 


U 




286 


OPERATORS AND MATRICES 


§53 


an infinite straight line through 0. If a ray contains a nonvan- 
ishing vector that is perpendicular to a nonvanishing vector lying 
on another ray, the two rays are said to be (mutually) orthogonal. 
The symbol u + v denotes the vector sum of u and Vj that is, 
the vector forming, in the usual sense, the diagonal of the parallelo- 
gram with vector sides u and v. The methods of this section will 
be entirely geometric, so that, for example, to specify a point or a 
line in our plane, we must mark it, and to add two given vectors 
we must use ruler and compasses. 

By definition, n vectors, wi , Wn , • • • , u^n) , are said to be 
linearly independent if it is impossible to choose n constants, 
Cl , C 2 , • • • , Cn , not all of which are zeros, in such a way as to 
satisfy the equation 


CiUi - 1 - C^Uii -1- . . . + CnW(n) = 0) (1) 


otherwise, the n vectors are said to be linearly dependent. The 
reader will convince himself that any three vectors l 3 dng in a plane 
are linearly dependent, and that two vectors lying in a plane are 
linearly dependent if and only if they lie on the same ray. 

Mappings. A process of rearranging the points comprising a 
plane is called a transformation or a mapping of the plane. The 
following mapping, which we shall call the mapping r, is a simple 

example: we draw in our plane the 
mutually perpendicular lines a and h 
shown in Fig. 81, and then move each 
point in the plane so that it does not 
cross either a or 6, so that its distance 
from a is halved, and so that its dis- 
tance from '6 is trebled; the point ui 
in Fig. 81 is thus sent into the place vi, 



the point un into the place vn, 
and so on. To illustrate the 
effect of this mapping perhaps 
more clearly, we draw in Fig. 
82 a circle with center at 0 
and also the ellipse into which 
the mapping r sends the points 
of this circle; the reader will 
verify that the specifically 
marked points of the circle are 
sent, respectively, into the 



Fig. 82. A circle and the ellipse into 
which the mapping r sends this circle. 


similarly marked points of the ellipse. 


287 OPERATORS AND MATRICES §53 

Mappings will usually be denoted by Greek letters. The 
symbolic equation 

au = (2) 

read times u equals w/ will mean that the .mapping a sends the 
vector u into the position occupied, prior to the mapping, by the 
vector V] thus in the case of our special mapping r we write rui = 
j j and so on, where ui , Un , vi , and vu are the par- 

ticular vectors so marked in Fig. 81. 

If a mapping a sends each vector into a definite place, we 
say that a is unambiguous. If all distinct vectors are sent by a. 
into distinct places (that is, if the inequality u 9 ^ v implies the 
inequality cm 9 ^ av) we say that a is nonsingular] otherwise 
(that is, if there are some u and v such that u 9 ^ v but au = av) 
we call a singular. If both equations 

a{cu) — c{au) and a{u + v) = aw -f av^ c, w, v arbitrary, (3) 

hold, we say that a is linear. The reader will verify that our 
special mapping r is unambiguous, linear, and nonsingular. 
We shall use the term mapping for unambiguous linear mapping^ 
since only such mappings will be of interest to us; further, we shall 
regard mappings as operators whose operands are vectors^ and shall 
use the terms mapping and operator interchangeably. 

When our r acts on a vector, it in general moves the vector from 
one ray to another; for example, the vector ui of Fig. 81 and the 
vector rwi — vi , into which t sends ui *, lie on diJfferent rays. 
But the vectors lying on the rays a and b behave in an exceptional 
way; in fact, as is perhaps best seen from Fig. 82, 

rw = 3w, if u lies on a (4) 

ru = iw, if w lies on 6, (5) 

so that a vector lying on the ray a remains on this ray after the 
mapping r is carried out, and a vector lying on b remains on 6. 
If a nonvanishing vector w is sent by an operator a into a numerical 
multiple of itself, that is, if 

an “ Xw, (6) 

where X is a number, we call w an eigenvector of a, call X an eigenr 
value of a, and say that the eigenvector w and the eigenvalue X of a 
belong to each other; this terminology, except for the omission of 



288 


OPERATORS AND MATRICES 


§53 


references to boundary and continuity conditions, is parallel to 
that of §3, and (6) has precisely the same form as 15^. Equation 
(4) aignififis , in these terms, that any nonvanishing vector lying on 
o is an eigenvector of our mapping t, belonging to the eigenvalue 
3 of t; similarly, Equation (5) signifies that any nonvanishing 
vector lying on 5 is an eigenvector of our t, belonging to the 
eigenvalue i of t. A ray containing an eigenvector of an oper- 
ator is called an dgmray or a principal axis of the operator; thus 
the ray a is the eigenray of our r belonging to the eigenvalue 3, 
and b is the eigenray of our r belonging to the eigenvalue 
any nonvanishing vector lying on an eigenray of a linear operator 
is of course an eigenvector of the operator. It is perhaps 
obvious on geometric grounds that the rays a and 6 are the only 
eigenrays of t, and that consequently the only eigenvalues of our r 
are the two, 3 and i, found above. Any two eigenvectors of 
T belonging to the eigenvalue 3 arc linearly dependent, and 
hence this eigenvalue is said to be nondegeneraie; similarly, the 
eigenvalue § of t is nondegenerate. This use of the term 
‘nondegenerate eigenvalue’ is parallel to that of §7. 

Further examples of mappings. Another simple mapping, to 
be called y, is the following: We draw in our plane the line A 
shown in Fig. 83 and then reflect in A every point of the plane, so 




Fig. 84. Diagram of 
the reflection y. 


that the point ui is sent to the place uu , the point wh to the place 
ui, the point um to the place uiv, and so forth. To illustrate 
the effect of y more elaborately, we show in Pig. 84 the plane, prior 
to the reflection, with an outline of a postcard (full line) with 
center at 0 drawn on it; the position of the outline of the postcard 
after the reflection is shown by the dotted line.^ The latter 
position is shown once again in Fig. 85. 


1 It should perhaps be emphasized that Fig. 84 is meant to illustrate 
the reflection of the plane in the line A and not the rotation of the plane 




289 


OPERATORS AND MATRICES 


§53 



1 7 ♦ S € 

Identical Reflection Reflection Reflection 

Mapping in Line A in Line B in Line C 



i n i -1 

Reflection Clockwise Counter Clockwise Inversion 

in Line D 90®- Rotation 90®- Rotation in 0 

About 0 About 0 

Fig. 86. Diagrams of eight mappings of a plane. The lines A, R, (7, and 
D are those of Fig. 86. 


In general, y shifts a vector from one ray to another, except 
that (Fig. 84) 


yu = w, ii u lies on A (7) 


= — -w, if u lies on S. (8) 


The eigenvalues of our y are consequently 1 and —1, the eigenray 
belonging to the eigenvalue 1 being A, and the eigenray belonging 
to the eigenvalue — 1 being B. 


The effect of a reflection of the plane in 
the ray A, S, C, or D of Fig. 86 is indicated 
in diagrams y, 5, €, and f of Fig. 85 by the 
respective position into which it sends the 
outline of a postcard whose initial position 
is shown by the full line in Fig. 84. 

The mapping consisting in multiplying 
every vector of the plane by a constant c is 



Fig. 86. 


through 180° about the line A as an axis; the rotation (which would display 
the back of the postcard) would involve getting out into the third dimen- 
sion, while the transformations that we consider now are all carried out 
within the plane. Similar remarks apply to the other reflections of the 
plane mentioned below. 







290 


OPERATORS AND MATRICES 


§53 


denoted by the symbol c. Each non vanishing vector is an eigen- 
vector of this mapping and belongs to the single eigenvalue c of 
the mapping. Every three, but not every two, eigenvectors of 
the operator c are linearly dependent; for this reason the single 
eigenvalue c of the operator c is said to be doubly degenerate, this 
use of the term being parallel to that of §7. Special cases of 
mappings of type c are the mapping 1 (the identical mapping) , 
which leaves every vector in its original position, and the map- 
ping — 1 (the inversion in 0), which reverses the direction of every 
vector without changing its length; these mappings are illustrated 
in Fig. 85. 

A rotation of every vector in the plane about 0 through the same 
angle is another example of a mapping of the plane. The respec- 
tive effects of a clockwise 90®-rotation and of a counterclockwise 
90®-rotation about 0 are illustrated in diagrams tj and ^ of Fig. 85. 
Since the mapping n (as well as the mapping J) rotates every 
vector through 90° and thus removes every nonvanishing vector 
from its original ray, this mapping has no eigenvectors and eigen- 
values when regarded from the standpoint of ordinary geometry; 
but we shall presently generalize our viewpoint, and then both 
7) and I will turn out to have imaginary eigenvalues. 

The eight mappings of Fig. 85 are of course rather special 
examples of mappings of a plane; but they are particularly simple, 
and we shall often use them for illustrations. 

Algebraic relations between mappings. In dealing with map- 
pings, we speak of their sums, products, and so on, the definitions 
of equality, sum, and product of mappings being just those given 
in §2 for operators in general. Operator-algebraic relations 
between mappings can be illustrated by reference to Fig. 85. 
We note, for example, that if an arbitrary vector is reflected in the 
line C (operation e) and the resulting vector is turned clockwise 
through 90° (operation 77 ), then the final vector is just that obtained 
by reflecting the original vector in the line B (operation 6); 
symbolically, 

r}eu = 8u, u arbitrary, (9) 

and hence tjc = 5. (10) 

Again, if an arbitrary vector is turned clockwise through 90° 
(operation 77 ) and the resulting vector is reflected in the line C 



291 


OPERATORS AND MATRICES 


§53 


(operation e), then the final vector is just that obtained by reflect- 
ing the original vector in the line A (operation y) ; symbolically, 

€y\u = 7 W, u arbitrary, (11) 

and hence erj = 7* (12) 

Since y and 5 are different operations, we conclude from (10) and 
(12) that 


7j€ 7^ erj, (13) 

that is, that the mappings e and rj do not commute. 

If an arbitrary vector is turned clockwise through 90® (opera- 
tion rj) and the resulting vector is turned counterclockwise through 
90® (operation (), then the final vector is just the original vector, 
so that so to speak, undoes what rj does; symbolically, (tju == u 
for an arbitrary w, and hence 

= 1. (14) 

In a similar way we find that 

V( = 1, (15) 

and comparison with (14) show's that rj and f commute. Inci- 
dentally, Equations (14) and (15) suggest writing 

f} = and f = 'rr\ (16) 

and calling rj and { the reciprocals of each other. An operator 
may be its own reciprocal: the operators 1, r, 5, €, and — l of 
Fig. 85 are examples. It will be shown in Exercise 5 that some 
operators have no reciprocals. 

The geometric notion of mappings, discussed above with special 
reference to a plane, is readily extended to the case of a three- 
dimensional space or, as we shall say for short, a 3-space. Just 
as a nonsingular linear mapping of a plane in general sends a 
circle into an ellipse, so a nonsingular mapping of a 3-space in 
general sends a sphere into an ellipsoid. 



292 


. OPERATORS AND MATRICES 


§53 


Exercises 

1. Identify the respective eigenrays and eigenvalues of the mappings 
5, €, and f of Fig. 85. 

2. Verify Table 11, which 


Table 11 comprises the multiplication ia- 



1 

7 

s 

e 

r 

V 


-1 

ble of the operators of Fig. 85; 
the product a/S is recorded at 
the intersection of the column 

1 

1 

7 

s 

€ 

r 

V 


-1 

7 

7 

1 

-1 


V 

r 

€ 

a 

labeled jS and the row labeled a. 

8 

8 

-1 

1 

V 


6 

r 

7 

3. Use Table 11 to show that 

€ 

« 

V 


1 

-1 

7 

8 


(H-7)(l - 7 ) =0, (1 + 7 )^ = 

r 

i* 



-1 

1 

8 

7 

€ 

2(1 -b 7 ), ei? — 776 = 27 , and ct? -|- 



e 

r 

s 

7 

-1 

1 


776 = 0 , and verify these rela- 



f 

€ 

7 

a 

1 

-1 

V 

tions by geometric construc- 

-1 

-1 

a 

7 

r 

6 



1 

tion. Show that, if ( 7 ’?)"^ tie- 


notes the reciprocal of 717 , then 


4. Show geometrically that, if two eigenrays of a mapping of a plane 
coincide with two eigenrays of another mapping of the plane, then the two 
mappings commute. 

6. Show geometrically that a singular mapping has no reciprocal; 
remember that we restrict ourselves to unambiguous mappings. 

5. Show geometrically that a mapping having the eigenvalue 0 is singu- 
lar, and that every singular mapping has the eigenvalue 0. 

7. Let ai •, a 2 , aa j ** • j be operators, and let (-«»«,*) and (a/a*) denote 
for the moment the operators aioif and a, a* , respectively; the a's are then 
said to obey the associative law for multiplication if (afay)ajb = o£i(o£,*a*) 
for every f , jj and k. Verify that the operators of Fig. 85 obey this law. 

8 . Incidentally, a set of operators is said to form a group if all of the 
following rather stringent conditions are satisfied : (a) the members of the 
set obey the associative law for multiplication; (b) the set contains the 
operator 1 ; (c) every member of the set has a reciprocal, and this reciprocal 
is a member of the set; and (d) if a and i? are any two members of the set, 
then afij ^a, and are also members of the set. Verify that the eight 
operators of Fig. 85 form a group, and that the four operators 1 , 7 ;, and 
—1 also form a group when taken by themselves. 

9. Show geometrically that any set of four or more vectors in 3-space is 
linearly dependent. 

10. A certain mapping of 3-space consists in reflecting every point in a 
plane passing through 0; show geometrically that the eigenvalues of this 
mapping are —1 and 1, the first being nondegenerate and the second being 
doubly degenerate. 

11. Let 0 - be a reflection of 3-space in a plane S passing through 0, 
and r be a reflection of the 3-space in a plane T passing through 0. Show 
geometrically that <rr is a rotation about the line of intersection of the 
planes S and T, and that (tt = 



293 


OPERATORS AND MATRICES 


§54 


64, Representation of Mappings by Means of Matrices 

The purely geometric methods of the preceding section become 
unwieldy in the case of more complicated mappings, and we shall 
now turn to the methods of analytic geometry. To do this, we 
draw in our plane a rectangular coordinate frame with origin at 0, 
say the frame P with axes Pi and P2 , shown in Fig. 87, which are 
supposed to lie along the lines C and D of Fig. 86; the basic unit 
vectors of this frame will be denoted by ef and 62 . Any vector u 
in the plane can be resolved in terms of el and ef , that is, expressed 
in the form 

u = ulel -f- (1) 

where xil and ul are two numbers, called the components of the 
vector u in the frame P. The superscripts P in our symbols would 
be unnecessary were it not for the 
fact that we shall presently consider |P2 

other coordinate frames and com- 
pare the results; to make the no- 
tation unambiguous, we shall agree 
not to use the superscripts P, Q, 
and R as exponents, that is, to in- 
dicate powers. Given the coordinate 1 

frame, the components of u specify I p 

u completely. If w = cv, where c Fig, 87. The frame P. 

is a constaijit, we have ul = cvl and 

ul = CV2, so that, in particular, if u = v, then ul == Vi and ul = 
vl. If w — u + V, then ~ ul -f vl and wl = W2 + vl. If the 
vectors u and v are orthogonal (mutually perpendicular), then, 
as the reader will verify, we have 

ulvl -j- ulvl ~ 0. (2) 

The square of the length of a vector u is 

+ {ul)\ (3) 

Vectors of unit length arc called unitary] pairs of unitary and 
mutually orthogonal vectors are called orthonormaL 

Let us now consider the operator 7 of the last section, namely, 
the reflection of the plane in the line A, and find the relations 
between the components ul and ul of an arbitrary vector u and 




294 


OPERATORS- AND MATRICES 


§54 


the components »f and Vi of the vector v = yu, that is, the^ vector 
into which y sends the vector u. As seen from Fig. 88, uf = Us 
and V2 = Ui, that is. 



K = 1-Ui + 0-M^ 

Equations ( 4 ) completely specify 
the effect of the operator 7 and 
enable us, in particular, to compute 
by algebraic means the eigenvalues 
and the eigenrays of 7- If 
eigenvector of 7, so that yu = v ^ 
\Uj then 2^1 = \ui and V2 == XwJ, 


and hence, if u is an eigenvector of 7, Equations ( 4 ) go over into 


XWl 0*lil -|" \ 'U 2 

\U2 ” 1 ■ Ui -j- 0 • U 2 , 

that is, into 

f — \Ui -f“ ^2 “ 0 
\ Ui — Xwf = 0. 


( 5 ) 

( 6 ) 


Equations (6) — which contain three unknowns, X, and U2 — 
enable us to evaluate just the quantities we want, namely, X and 
ul/ulj that is, the eigenvalues of 7 and the slopes of the eigenrays 
of 7 relative to the frame P. Indeed, in order that these simul- 
taneous linear homogeneous equations have solutions other than 
the trivial ul = ul — 0, it is necessary and sufficient that the 
determinant of the coefficients be zero, that is, that 



or X^ — 1 = 0, and 

X = zfcl. (8) 


The eigenvalues of our 7 are thus 1 and —1, a result obtained by 
purely geometrical considerations in the precedirfg section. 

To calculate the eigenvectors, or rather the eigenrays, of our 7 
we substitute the permissible values of X, one by one, into (6), 
and find that 



295 


OPERATORS AND MATRICES 


§54 


for X = u%/u[ = 1 (9) 

forX=~l, (10) 

Equation (9) states that, relative to the frame P, the slope of the 
eigenray of 7 belonging to the eigenvalue 1 is 1; Equation (10) 
states that, relative to the frame P, the slope of the eigenray of 7 
belonging to the eigenvalue —1 is —1. These results check with 
those of the preceding section, where we found the eigenrays of 7 
belonging to the eigenvalues 1 and — 1 to be the rays A and P, 
respectively. 

In Equations (4), which specify analytically the effect of our 
mapping 7, the components uf and ul are variables, and hence 
the structure of (4) can be adequately described by listing in 
some readily decipherable manner the fixed numerical coefficients 
on the right of (4); the standard method of doing this is to arrange 
the coefficients in exactly the same positions as they occur in (4), 
and to lot th(' array, or matrix^ 



summarize the essential content of Equations (4). This matrix 
enables us — by means of Equations (4) — to calculate the posi- 
tion into which 7 sends any given vector; we call it *the matrix 
representing the operator 7 in the frame P,’ denote it by 7^, and 
write 



We shall always mark matrices having a finite number of rows 
and columns by large parentheses, as in (12), and determinants 
by vertical lines, as in (7). 

In the case of the other operatons of the preceding section, the 
passage from gc'ometric to analytic, mc^thods can b(' carried out 
in an entindy analogous way. For 
example, if u is an arbitrary vector ' 

and V is the vector that is, the 


vector into which u is sent by a 
counterclockwise 90® rotation about 
0, we find, as seen from Fig. 89, 
that 



V\ = O’Ui — 1 ■ Ui 
= I 'Ui + 0*^2 


(13) 


Fig. 89. 



296 OPERATORS AND MATRICES §54 


and that consequently the matrix representing the operator J 
in the frame P is 





(14) 


Similar calculations show that the eight operators of Fig. 85 are 
represented in the frame P by the following matrices: 




(15) 


In general, if the components, relative to a frame P, of every 
vector u and of the vector v = au, into which a mapping a sends 
are related by the equations 


— QAJL\ -|“ 
vi = cui + 


(16) 


where a, 6, c, and d are numerical constants, we say that the 
matrix 



represents the mapping a in the frame P; conversely, the state- 
ment that a mapping a is represented in the frame P by the 
matrix (17) means that the components of an arbitrary vector u 
and of the vector v = an, into which a sends w, are related through 
Equations (16). It is, of course, impossible to summarize th(^ 
significance of a matrix by quoting any single number. When 
the numerical values of the constants a, 6, c, and d in (16) and 
(17), having to do with a mapping a and the frame P, are not 
given explicitly, we usually denote them by an , an , an , and 
an ; (16) and (17) then have the forms 

~ 0 ^ 11^1 “ 1 “ ann2 

V2 = otnUi + anUz 


/ p p \ 

^an a^y 
^0121 anJ 


|. (18a, b) 


Whenever it will be agreed to keep the coordinate frame in mind, 
the superscripts in (18) will be omitted; a mapping and the matrix 
representing it will then be denoted by the same symbol. 



297 


OPERATORS AND MATRICES 


§54 


Vectors with complex components. The ap- 
plication of algebraic methods to geometric 
problems suggests a generalization of terminology. 
For example, from the purely geometric viewpoint, 
the straight line y 2x and the parabola y = 
-1- 2 of Fig. 90 do not intersect. To investi- 
gate the question of the intersection of these 
curves from the standpoint of analytic geom- 
etry, we look for the simultaneous solutions of 
the equations 



The solutions 
and 


y = 2x, 2 / = + 2. 

(19) 

Xi = 1 4“ ?/i = 2 4- 2i 

(20a) 

a:2 == 1 — 2/2 = 2 — 2i 

(20b) 


are complex, as is to be expected from the fact that the curves do 
not intersect from the geometric viewpoint. Now, the solutions 
(20) enable us to say, if wo wish, that the straight line and the 
parabola of Fig. 90 do intersect, provided we include points whose 
coordinates arc complex numbers. 

The notion of points with complex coordinates or, what is the 
same thing, of vectors with complex components is a useful one 
because it enables us to retain geometric terminology in certain 
purely algebraic calculations. Although matters of terminology 
may at first sight seem trivial, the retention of geometric terms 
in our algebraic work turns out to be profitable, if only because it 
helps us to remember (and in some cases perhaps also to visualize) 
certain algebraic relations and processes. Accordingly, we shall 
henceforth speak freely of vectors having complex components 
and shall treat them on the same footing as vectors having real 
components. The totality of all vectors of the form (1), that is, 


p p , p p 

U — Ui ex -f- W2 C2 , 


( 21 ) 


where now Ui and are arbitrary real, imaginary, or complex 
numbers, will be called the complex plane, or simply the plane. 
The points in the physical plane of the paper in Fig. 87 comprise 
only a portion of our complex plane; the complete complex plane 
can be visualized no more readily than can the two points of 
intersection of the parabola and the straight line in Fig. 90. If u 



298 


OPERATORS AND MATRICES 


§54 


is a vector in the complex plane, then the locus of the termini 
of all vectors of the form cm (where c is now not necessarily real) 
comprises a ray, the picture of a ray in the complex plane as a 
straight line through 0 is of course an imperfect one. 

The reader should not confuse our complex plane with the plane, called 
the Argand diagram, used for graphical representation of complex numbers 
in the theorj' of functions of a complex variable, in the treatment of alter- 
nating currents by means of complex numbers, and so forth. In an Argand 
diagram the number a -|- bi is associated with a point whose coordinate 
along the ‘axis of the reals’ is the real number a and whose coordinate 
along the ‘axis of the imaginaries’ is the real number b. Our complex 
plane, on the other hand, contains points whose coordinates arc complex. 


The uniformizing effect of working in the complex rather than 
in the real plane can be illustrated further as follows: In di.s- 
cussing the eight mappings of Fig. 85 from the geometric view- 
point, we had no difficulty in identifying their eigenvalues and 
eigenvectors, save in the case of the rotations jj and eigen veStons 
and eigenvalues had in fact to be denied to ri and |, so that these 
mappings appeared to be exceptional. The situation is different 
from the analytic standpoint. For example, the analytic sig- 
nificance of the rotation | is given by (13), so that the circum- 
stances under which {m = \u may be determined through Equa- 
tions (13). Writing p = Xm in (13) we get 


I Xuf = O-Mf — 1-M2 

( Xmj “ 1 • Ml -j- 0 Ma 


thdt is 


f —\Ui — Ma = 0 

1 wf - = 0. 


(22a, b) 


These equations have solutions other than Mf = m^ = 0 if the 
determinant of the coefficients vanishes, that is, if X* -f- 1 =0, or 


X = ± f. (23) 

The eigenvalues of our f are consequently i and —i; this result 
was foreshadowed in Table 11, according to which == —1. 

To compute the eigenrays of | we substitute the eigenvalues 
(23), one by one, into (22) and find that 


for X = i, Ui/ui = -z (24) 

for X = -i, Ui/ui = i. (25) 

The vectors having these imaginary ‘slopes’ relative to the frame 
P are thus not removed from their original rays when the plane 



299 


OPERATORS AND MATRICES 


§54 


undergoes a counterclockwise 90° rotation; instead, they are 
multiplied, respectively, by the constants i and —i. In terms of 
the complex plane, our ^ has two distinct eigenvalues and, just 
as every other nondegenerate mapping in Fig. 85, two eigenrays. 

When the plane under consideration is complex, the expression 
(3) for the square of the length of a vector needs modification, 
since (3) may be negative or complex when and are complex; 
we then adopt the expression 

(26) 

for the square of the length of u, (ui)* and (U 2 )* being the complex 
conjugates of the numbers wf and . Similarly, we take the 
condition 

= 0, (27) 

or the equivalent condition ivi)*Ui + 0 > 2 )*W 2 = 0, as the con- 
dition for orthogonality of u and v. When the components of u 
and V are real, (26) and (27) go over into (3) and (2), respectively. 

Consecutive operations. Let the matrices representing two 
mappings a and /3 in some frame be 


a 


( otn ai2 

Q!21 0!22, 


and 




(28) 


What is the matrix representing in the same frame the mapping 
q:/ 3? Now, if /3 sends u into Vj and a sends v into then, by 
definition, the mapping a/3 will send u directly into w; our task is 
consequently that of expressing the components of w in terms 
of the components of Uj it being given that 


f Vi — ^nUi + 012^2 [ 'll?! — 4" ai2t?2 

i and i , 

[v 2 ~ 02lUi + fewo = Ot 2 lVi + a' 22 l ?2 • 

Eliminating vi and V 2 from (29b) by means of (29a), we get 
I = (anjSii + -j- (<^11 fo + 0'i2fc)U2 

[W2 = (a2iiSll + O^ 220 il)y>l 4 " (^21012 4 " OC22 022^^2 
for an arbitrary u, and it follows that the desired matrix, which is 
denoted by a/3, is 


(29a, b) 


(30) 


a0 


■(: 


OiU 0 n 4 " Oil202l (Xu 012 4 " (Xl2 02!^ 
OtilPu 4 " “ 22/321 cxsi^lt + OCtaPtt) 


(31) 



soo 


OPERATORS AND MATRICES 


§54 


The reader may recognize the matrix (31) as what is called in 
algebra the product of the matrices a and ^ given by (28) ; our 
result can thus be stated in the following readily remembered 
way: the matrix representing the product afi of the operators 
oc and jS is the product ot^ of the matrices representing, respec- 
tively, the operators a and 


In algebra, the matrix product 

/ ail ai2 


trices 


12/ \021 022/ 


/ an ai2 

\a2i a22, 


!/ \521 522/ 


of the ma- 


is defined to be the matrix written 


\a2i a22/ 

on the right side of the following matrix equation : 


an ai2\ /5ii bm /an 6ii + ai2 621 an5i2 + ai2 522 

021 a22/ \521 &22/ \a21 611 -f- a22 621 a2i 612 + a22 522// 


(32) 


The reader should carefully memorize the arrangement of the 
entries in (32), noting, in particular, that the entry in the nth 
row and mih column of the product is constructed from the entries 
in the nth row of the left factor and the mth column of the right 
factor; the following diagram illustrates the computation of the 
entry in the first row and the second column of the product : 




ail 612 ai2 5j 


')■ 


(33) 


The multiplication of matrices having more than two rows and 
two columns will be discussed presently. 


To illustrate the implications of our result (31) let us take from (15) 
the matrices ^ ^ j and ^ | and use matrix multi- 

V -1 0 / Vo - 1 / 

plication to compute the matrix representing in the frame P the mapping 
Tie. We have 


/O 1\/1 0\/ 0-1 + 1-0 O.O-f-l.(-l) \ 

\-l 0/\0 -ly V("l)*l + 0-0 (- 1).0 + 0 .(- 1 )/ 

that is, in the light of (16), 

17^ e*" = (35) 

Equation (35) is of course to be expected in view of 10'* 


To compute the matrix representing the mapping /3a when the 
matrices representing the mappings a and /S are given to be (28), 



301 


OPERATORS AND MATRICES 


§54 


we consider the vectors au ^ v and pv — w, where u is arbitrary, 
and express the components of w in terms of those of u. The 
desired matrix, denoted by /3a, turns out to be 

+ fiuoin Ai^12 + Pi2 0C2s\ 

); (36) 

P2lOin 4 " p220t2l ^2lOil2 + p22 0i22/ 

this result could have been obtained directly from (31) by inter- 
changing the symbols a and /S, but leaving all the subscripts as 
they stand* According to the definition (32), the matrix (36) is 
just the matrix product jSa of the matrices (28). Comparison 
of (36) with (31) shows that the matrices and /3a are usually 
different from each other, so that matrix multiplication is in 
general noncommutative. 

To illustrate: let us take from (IS/the matrices and and compute 
the matrix representing in the frame P the mapping erj. We have 

/I 0\/0 l\ / 104*0-(-l) M + 0-0 \ 

\0 -iA-1 0/ \0-0 + (-l)-(-l) 01 + (-l)*0/ 

■ C O’ 

and comparison with (34) shows that the matrices and do not commute. 
Incidentally, according to (15), Equation (37) moans that 

// ==T^ (38) 

as is to bo expected from 12”. 


Sums of operators. If the matrices representing two mappings 
a and ^ in some frame are (28), what matrix represents in the 
same frame the mapping a + /3? Now, according to 12^, a + /3 
is that mapping which sends any vector u into the sum of the 
vectors ati and 0Uj so that, if (a + fi)ii = v, then v ^ au + fiu; 
consequently 


Vi = aiiWi + ai 2 W 2 + iSuUi +• ^12^12 

= (an + Pii)ui + (ai2 + ^12)^2 

V 2 = a2l'Ui 4~ ^22^2 4" 4" ^*^2 

= (a2i 4” ftO'Wi 4" (<^22 4" fe)^2 


(39) 


and hence the desired matrix, which we shall denote by a 4- i?, is 


Oi + 


( oiii 4* ai2 4" /3 i2 
0:21 4“ ^21 ^22 4" ^ 22 , 


(40) 



302 


OPERATORS AND MATRICES 


§54 


In algebra the mm of the matrices ) and 

\ a 2 i 022/ \621 022 / 


IS 


defined as the matrix written on the right side of the following 
matrix equation: 



Oil + bn Oi2 + 5l2^ 
,0^21 + ^21 O 22 b^n) 


(41) 


the reader should carefully memorize the rule for matrix addition 
expressed by (41). Our result (40) can now be stated thus: the 
matrix representing the sum of the operators a and jS is the sum 
of the matrices representing, respectively, the operators a and jS. 
For example, according to (15), the matrix representing in the 

frame P the sum ‘of our mappings 7 and { is 

Representation of vectors by columns of numbers. In treating 
mappings by analytic methods, we found it convenient to repre- 
sent them by square arrays of numbers, that is, by square ma- 
trices. A certain regularity is introduced into our formalism if 
we also agree, when listing the components of a vector, to arrange 
the components into a column. Thus, when working in some 
coordinate frame that is to be kept in mind, we shall write 




instead of saying that the components of a vector u are U\ and 
For example, we shall write 



instead of saying that the components of the particular vector v 
with respect to the given frame are vi = 1 and va = 3. Column 
symbols such as (42) and (43) are called representatives of our 
vectors; but for brevity we shall usually refer to column symbols 
simply as vectors. 

We shall next set up rules for the manipulation of the column 
symbols. First of all, whenever u ^ v, we agree to write 



303 


OPERATORS AND MATRICES 


§54 


so that (44) will imply the two equations 

Ul = Vi , 112 == V2 , (45) 

and conversely. Substituting the column symbols for w, y, and w 
into the equation zy = w + y, we get 



now, \i w — u Vj then zyi = -f Vi and zra = W 2 + ^2 ; conse- 
quently, in order that (46) may imply the equation w — u + 
we must adopt the following rule for addition of our column 
symbols : 



Similarly, if y = cw, where c is a constant, a direct substitution of 
our new symbols yields 



now, if y = cw, then Vi = cui and ^ CU 2 ] consequently, in order 
that (48) may imply the equation v = cu, we must adopt the 
following rule for multiplying a column symbol by a constant: 



(49) 


Finally, if an = v, the direct substitution of our symbols gives 



(50) 


But the equation au = v means that Vi and vz are related to Ui 
and M 2 through Equations (18); hence, in order that (50) may be 
equivalent to au — v, we must adopt the rule 



-j- Oti2U2 
CiziUl 0 ^ 22 ^ 2 , 


(51) 


The rule (51) for the multiplication of a vector by a matrix re- 
sembles the rule (32) for the multiplication of a matrix by a 



304 


OPERATORS AND MATRICES 


§54 


matrix; the following diagram illustrates the computation of the* 
entry in the second row of the right side of (51) : 


( )(t) = ( 

oasV \i 12 / V 


+ ^ 22 ^ 2 / 


(52) 


According to (9), any eigenvector u of our mapping r, belonging 
to the eigenvalue 1 of 7, satisfies the equation u^lui — 1, so that 
the vector 


u = 


(63) 


in the frame P, may be taken as a sample eigenvector of 7 belong- 
ing to the eigenvalue 1 . Multiplying (53) by 7^ from (15), we get 


that is, 





= 1 


(55) 


Equation (55) means that operating on the vector (53) with the 
matrix 7^ is equivalent to multiplying (53) by the constant 1 ; 
Equation (55) thus restates that the vector (53), when the 
column symbol is interpreted in terms of the frame P, is an eigen- 
vector of 7 belonging to the eigenvalue 1. Similarly, according 
to (10), a sample eigenvector of 7 belonging to the eigenvalue — 1 , 

when referred to the frame P, is -w = ^ multiplying this 
vector by v'. we indeed get _ 

The rules for the analytic treatment of mappings of a plane 
presented above will be presently extended to mapping of spaces 
of higher dimensionality. 

Exercises 

1. Show that a mapping consisting in multiplying every vector of a 

plane by a constant c is represented by the matrix ( ^ V and justify 

\0 c/ 



305 


OPERATORS AND MATRICES 


§54 


the following rule for the multiplication of a matrix by a constant: 

( <*11 oimN / can caisX 

aji as2/ \Ca 21 ca22/ 

—.(-3 :)-c D'Ca -X :)- 

/ 23 17 \ /7 sy 1 2 \ _ /4 3\ / 1 2 \/5\ 

\-63 .-39/’ Vs e/V-S -4/ ~ 4/’ \-3 -i/W 

ahOOQ- 

3. Verify that the multiplication table of the matrices (15) is Table 11^® 
4. Use the matrices (15) and matrix algebra to verify the relations 
(1 + 7)(1 — 7 ) == 0, (1 + 7 )^ = 2(1 + 7 ), €rf — rje = 2y, and €17 + 7?6 = 0 
derived geometrically in Exercise 3®®. 

6, Working in the frame P and using the matrices (15), show that, if 


' ■ O' " 


then yv 


and 


= O' - ■ (O' •• X-0 (O' 

~ ( 1 ) ^ where the mappings are those of Fig. 85. 


6. Use the matrices (15) to 
investigate analytically the ei- 


Table 12 


genvalues and eigenvectors of 
the mappings of Fig. 85, and 
verify Table 12. 

7. Show that the two eigen - 
rays of every one of the map- 
pings of Table 12 are orthogo- 
nal. Show that the length of 
each of the particular eigenvec- 
tors of 7 , 5, t), and f listed in 
this table is 

8 . A certain mapping <r of the 

plane is represented in the 
frame P by the matrix 
/1.04 .72 \ 

( I . Show analyti- 

V .72 1.46/ 

cally that the eigenvalues of <r 
are .5 and 2.0, and that sample 
eigenvectors of <r, belonging, 


respectively, to these eigenval- 
ues, can be represented in the 

frame P by ^ ^ . 

Note the orthogonality of the 


eigenrays of o-. 


Draw graphs of a unit circle 



and of the ellipse into which cr 

sends this circle; note that the axes of the ellipse lie on the eigenrays of a-. 



OPERATORS AND MATRICES 


9. A certain mapping p of the plane is represented in the frame P by 
/2.5 1.0\ 

the matrix I ) . Show analytically that the eigenvalues of p are 


‘“y 

\2.0 1.5/ 


.5 and 3.5, and that the corresponding eigenrays of p are not orthogonal. 

Draw graphs of a unit circle and of the ellipse into which p sends this 
circle; note that the axes of the ellipse do not lie on the eigenrays of p. 


66, Different Coordinate Frames May Yield Different Repre- 
sentatives for a Given Operator 

The matrices representing the eight mappings of Fig. 85 were 
computed in §54 with reference to the particular frame P of 
Fig. 87; to illustrate that the representatives of mappings, as well 
as of vectors, may be different in different coordinate frames, we 
shall now make some computations using the frame Q, shown 
in Fig. 91, whose axes Qi and Q 2 are supposed to lie along the 



I -V 

Fig. 91. The frame Q. Fig. 92. 


lines A and B of Fig. 86; the basic unit vectors of this frame will 
be denoted by ei and e?. Any vector u in the plane can be 
resolved in terms of e? and e® , the numbers u? and Uz in 

u = UiCi + uiei (1) 

being the components of u with respect to the frame Q. 

Let us consider first the relations between the components w? 
and U 2 of an arbitrary vector u, and the components Vi and y? 
of the vector v = yu, that is, the vector into which our mapping 
y sends the vector w, y being the reflection of the plane in the 
line A. As seen from Fig. 92, v? = u? and v? = —w? , that is, 

\^l = 1-Wi -f 0*‘W2 

jz;? = O-wi — 1-^2. 


( 2 ) 



307 OPEKATORS AND MATRICES §55 


Consequently the matrix that represents 7 in the frame Q and 
that we denote by is 


7 


<3 



( 3 ) 


This matrix is different from the matrix 7 ^ representing 7 in the 
frame P; it is, as we might expect, the same as 
Next, let us calculate the eigenvalues of 7 , using the frame Q, 
If u is an eigenvector of 7, so that v = yu = Xu, then v? = Xw? 
and V 2 = Xu 2 , and ( 2 ) becomes 


f(l - X)uf + O-u? = 0 

[O-w? + (-1 - X)u^ = 0. 


Equations (4) have solutions other than u? = U 2 = 0 if and 
only if the determinant of their coefficients vanishes, that is, if 
and only if 


|(1-X) 
I 0 


0 

(-1 - X) 


= 0 . 


(5) 


The solutions of (5) are X = dbl, and hence the eigenvalues of 
our 7 are 1 and — 1 , in agreement with 8 ^^ and with the geo- 
metric conclusions of §53. 

The reader will have made a long step toward understanding 
the discussion which is to follow if he remembers that the eigen- 
values of a mapping are intrinsic attributes of the mapping itself, 
and that, consequently, whatever procedure may be adopted in 
an analytic computation (for example, whatever coordinate frame 
may be used in representing the mapping by a matrix), the 
results of the computation of the eigenvalues of a given mapping 
should be quite the same. 

Finally, let us compute the eigenrays of 7 , using the frame Q. 
Setting X = 1 in (4), we find that the slope U 2 1 Ui , relative to the 
frame Q, of the eigenray of 7 belonging to the eigenvalue 1 is 0 ; 
this eigenray consequently coincides with the Qraxis and is the 
ray A, first identified in §53. Treating the case X = —1 in a 
similar way, we find that the following column symbols repre- 
sent, in the frame Q, sample eigenvectors of 7 : 



The column symbols referring to the frame Q and representing 
the particular sample eigenvectors of 7 that are listed in Table 12 



308 


OPERATORS AND MATRICES 


§55 


are obtained by multiplying the column symbols given in (6) 
by \/2ith.e resulting S 3 anbols are, as we might expect, different 
from the corresponding symbols listed in Table 12 and refc'rririg 
to the frame P. 

The eigenvectors of a mapping are intrinsic attributes of the 
mapping itself, and hence, whatever analytic method may be 
adopted in computing the eigenvectors of a given map])ing, tluj 
results should be the same; but the column symbol representing 
an eigenvector is usually different for different coordinate frames. 


Exercises 

1. Show that the following matrices represent the mappings of Fig. BS 
in the frame Q, and compare them with 15®^: 




Table 13 



2. Verify that the multiplica- 
tion table of the matrices (7) is 
Table 11. 

3. Work out Exorcise using 
(7) rather than 15®^. 

4. Use the matrices (7) to in- 
vestigate analytically the eigen- 
values and eigenvectors of the 
mappings of Fig. 85; verify Table 
13 and compare it with Table 12. 

6. Show that a counterclock- 
wise rotation of a real plane 
through the angle about 0 is re- 
presented in every (real) coordi- 
nate frame by the matrix 

cos <f> —sin if) 
sin 0 cos 4> 



309 


OPERATORS AND MATRICES 


§56 


66. Matrices and Vectors 


In §54, while systematizing the algebraic procedure for the 
study of mappings of a plane, we introduced matrices such as 

( oih o:i 2 \ /fin column symbols, or vectors, 

oLn/ \P2i P22/ 


such asf^^ 

\Ui 


and so on, and evolved certain rules for com- 


bining such arrays with one another. We shall now state the 
rudiments of the algebra of arrays of a more general kind. Our 
definitions and rules will be direct generalizations of those of §54, 
and th(‘ numbers at the left of some of the formulas below will 
refer to the corresponding formulas of §54. In the next section 
we shall show how a helpful quasi-geometric significance can be 
read into the algebraic steps of this section. 

Arrays of numbers such as 


an 

ai2 

o;iAr\ 


fill 

Pl2 



«21 

n!22 

• 1 

^ ^ 1 

’ fill 

1 


• • fiii^ 1 

(1) 


OiNt 

’ • OInkJ 


\0Ari 

/3a^2 




and so forth, are called square matrices of N rows and N columns, 
N-by-N matrices, or simply matrices^ and are usually denoted by 
single letters, as in (1). The individual numerical entries com- 
prising a matrix a are called elements or components of a; the ele- 
ment «„m of a, that is, the element in the 7/.th row and mth column 
of a, is (‘ailed the nmth element of a] the range of each of the 
subscripts occurring in tiiis section will bc^ 1, 2, • • • , N, and all 
matrices considered below' will have the same N. 

If (and only if) all of the corresponding elements of two ma- 
trices a and jS are equal, that is, if 

OJnw ~ ^nm (2) 


for all values of n and m, we write 


^ (3) 

and say that the matricc^s a and P are equal to each other. The 
symbolic equation (3) is essentially an abbreviation for the 
equations (2). 



310 


OPERATORS AND MATRICES 


§56 


The rule for adding two matrices is: 


an 


•• oiiA 

//5u 

P 12 • ' 

‘ • Pin 

0:21 

^22 

• • CXzy 1 

+ / 

^22 

■ • P 2 N 


0Cn2 • 

■ • otirj 

w 

^N2 • ■ 

• - Pnn, 


au + fti au + ft* 
a2i + fti a22 + ^ 


OtlK + fillf 
«sw + jSjir 


\(KnI “h Ctifi "f" ^N2 • * • O'JV’iV "f“ ^nnJ 

The matrix on the right side of (4) is called the sum of the ma- 
trices a and /3, and is denoted hy a + thus we have the 
relations 

(a -f- fi)nm = OLnm + 0nm > (5) 

where (a -|- 0)ntn is the nmiYi element of the matrix a -f jS. 

The rule for multiplying a matrix a by a constant c is: 


an 

ai 2 

• otiA 

can 

can 

OC21 

0:22 

• Oi 2 N 1 

\ can 

can 


aif 2 


\catri 

caN 2 


The matrix on the right side of (6) is called the product of the 
matrix a and the constant c, and is denoted by ca ; thus we have 
the relations 

(ca)nm = canmj (7) 

where {ca)nm is the nmih element of the matrix ca. 

The rule for multiplying a matrix from the left by a matrix a: is : 


an 

an 

• • ounX 

//3u 

Pn 

Pin 

Ot2\ 

«22 

•• a2Jv 1 

jfti 

P 22 

Pin 

^OiNl 

as2 

• • CXnn/ 

\pm 

Pn2 * • • 

Pnn, 



1 ^ 

•xiiPa 

Gl-lipi^ 

. . 



= 2 


2 ^2iPil2 




\2 

oiKifia 

^ OiNiPi2 

. . , 


(XiiPn ^ ^ aufiiiA 

S 0^2iPil 2 Oi2iPi2 • • • ^ Ctzi^is j 

2 ^ OtNi^a • • • X) OilfiPisj 

where the summation index i runs through the values 1, 2, • • • N 
The matrix on the right side of (8) is called the prodilct afi of the 



311 OPERATORS AND MATRICES §56 


matrices a and /3, and is denoted by ap; thus we have the 
relations 


^ — 1> 2, • • • , iNT, (9) 

where (oip)nm is the 7^mth element of the matrix ajd. 
Interchanging a and 13 in (8), we get 


//3u 

P 12 

Pi A 

focii 

Oil2 

P 2 I 

P 22 

P2N 1 

1 ^21 

«22 

Wi 

pm 

PnnJ 

\oiNl 

QJisr2 


OSlJV 


^0liOtil 

The matrix on the right side of (10) is called the product pa of the 
matrices a and jS, and is denoted by pa] thus we have 

{P^')nm ~ ^^^iPniCLim, j ^ — 1, 2, • • • , iV. (11) 

Comparison of (8) and (10) shows that the matrices aP and pa 
are in general not equal to each other, so that matrix multiplica- 
tion is usually noncommutative. 

Single-file arrays of numbers such as 





and so on, are called column symbols, N~vectors, or simply vectors 
(strictly, they are matrices of N rows and one column), and are 
usually denoted by single letters, as in (12). The N individual 
numerical entries comprising a vector u are called co 7 nponents 
of u] the component Uk , that is, the entry in the fcth row of u, 
is called the Ath component of u] all vectors considered below 
will have the same N , this N being equal to that of the matrices 
discussed above. The iV*-vector whose N components are all 
zeros is called the vector zero and is denoted by 0. The number 

(20 'LiuUi , i = 1,2, ... ,N, (13) 



312 OPERATORS AND MATRICES §56 

where w? is the complex conjugate of w, , is called the square of 
the length of the vector u. If 

(27®^) Ziuhi = 0, 1 = 1,2, ... ,N, (14) 

the vectors u and v are said to be orthogonal. 

If (and only if) all of the corresponding components of two 
vectors u and v are equal, that is, if 

Uk = Vk ( 15 ) 

for all values of A, we write 


u — V (16) 

and say that the vectors u and n are equal to each other. The 
symbolic equation (16) is essentially an abbreviation for the N 
equations (15). 

The rule for adding two vectors is: 


(47«) 



\ /Wl + Vl 

^2 I =s I ^2 d" 2^2 

\ t)y / \us + Vn! 


(17) 


The vector on the right side of (17) is called the sum of the vectors 
u and V, and is denoted by w + »; thus we have the N rolaf ions 


(U -f V)k = Mt + , 

where (m -|- v)k is the fcth component of the vector u -f v. 
The rule for multiplying a vector m by a constant c is : 


(18) 


(49“) 



(19) 


The vector on the right side of (19) is called the vroduct ol the 


where (cu)* is the Ath 
of vectors of the form 


(CM)i = CUk , (20) 

component of the vector cu. Th<; totality 
cu, where a is a fixed nonvanishing vector 



313 OPERATORS AND MATRICES §56 


and the constant c (which is allowed to be complex) varies from 
vector to vector, is said to form a ray. 

Finally, the rule for multiplying a vector « by a matrix a is : 


(51®") 


an 

au 

Oily \ 

/ Ui 

oin 

Q!22 

0'2y j . 

1 

\OiNl 

as2 • * • 

OiNNJ 

\ Uy 


I 


( 21 ) 


jOLtJxLLif 


where the summation index i runs through the values 1,2, • • * ,2V. 
The vector on the right side of (21) is denoted by a.u\ thus we 
have the 2V relations 


iOikiUi f i — 1, 2, •••, N , (22) 


where (mi)k is the Ath component of the vector au. 

In the first column of the accompanying table we list the 
fundamental symbolic equations that may hold between JV-by-2V 
matrices ce, /3, and so on, and iV-vectors u, v, and so on; the second 
column gives the sets of numerical equations that the respective 
s3nnbolic equations imply according to the definitions given 
above; the number of numerical equations in a set is given in the 
third column. The summation indices i run through the values 
1, 2, * • • , iV; it is of course immaterial what letter, not employed 
otherwise, is used for a summation index. The rules (23) to (31) 
apply also when N is infinite, that is, when the matrices have an 
infinite number of rows and columns and the vectors have an 
infinite number of rows. 



0!htb = l^nm 

(23) 

7 = 0! -f 

yam ” Oinm “H ^nm 

m (24) 

^ » ca 

finin ~ COCnm 

m (25) 

y = 

Tnm ~ 2»0{ni^iw 

(26) 

5 == 

5rim “ ^i^niaim 

iV* (27) 

n = V 

Uh = Vk 

N (28) 

ty =5 u + V 

= U* + Vk 

N (29) 

il 

Vk = CUk 

N (30) 

V ^ ctU 

Vk = l^idkiUi 

N (31) 



314 


OPERATORS AND MATRICES 


§56 


Repeated applications of these rules lead to the identification 
of more complicated combinations of matrices and vectors. 
For example, in the case of the matrix 

M)t (32) 

(that is, the product, in the given order, of the matrices and y), 
we have according to (9) 

[(a0)y]nm = (33) 

where [(a0)y]nm is the nmth. component of {a0)y; now, again 
according to (9), (aiS)ni = , so that we finally get 

[(o:^)'y]n»n “ • (34) 

Similarly, in the case of the matrix 

(35) 

(that is, the product, in the given order, of the matrices a and py) 
we have 

Wimnn. = Eian.-mrn, (36) 

or, since (/? 7 ),« = 

[oj(^7)]ntn • (37) 

Comparison of (34) and (37) shows that 

(afi)y = a{0y), (38) 

and hence matrix multiplication is said to be associative (Exer- 
cise 7 ). This result enables us, in particular, to use without 
ambigmty the symbol for the product, in the given order, of 
three matrices a, jS, and y, and to write 

afiy = (a/3)r = a{py), (39) 

it being i mm aterial which of the two interpretations is given to the 
symbol afiy. Similarly, the vectors a(Pu) and (aP)u are equal 
to each other, so that the symbol can be used without 
ambiguity: 


oipu = a{^u) = (a^)u. ( 40 ) 

Th^ h N-vectors wj , wn , • • • , u^K, are said to be linearly inde- 
pendent if it is impossible to choose k constants Ci , ct ,■■■, a , 
not all of which are zeros, in such a way as to satisfy the equatiori 
CiMi + wn + . . . + cua) = 0, where 0 denotes the i\^-vector 
zero; otherwise the k vectors are said to be linearly dependent. 



315 


OPERATORS AND MATRICES 


§66 


The matrices on the one hand and the vectors on the other are 
treated in our algebra as entities of essentially different kinds; 
thus, the addition of a vector to a matrix i® not provided for; also, 
although we have a rule for multiplying a vector from the left 
by a matrix (that is, a rule for interpreting the symbol au), we 
have no rule for multiplying a matrix from the left by a vector, 
so that a symbol such as ua is meaningless. In fact, the dis- 
tinction that our algebra makes between matrices and vectors is 
just the distinction between operators and operands. Accord- 
ingly, in the study of the interplay between matrices and vectors 
it is convenient to regard matrices as operators and vectors as their 
operands^ and to interpret the equation 

(31) otu ^ V (4l) 

as meaning that the matrix a, when operating on the vector u, 
sends it into the vector v. We shall leave it as an exercise for the 
reader to prove that the rules (23) to (31) are entirely consistent 
with an operator-operand interpretation of matrices and vect;Ors, 
that is, that these rules define equality, sums, and products in a 
manner consistent with the definitions of equality, sums, and 
products which we adopted in §2 while discussing operators and 
operands. 

Eigenvalues and eigenvectors of matrices. If a matrix a and 
a nonvanishing vector u are so related that 

au == (42) 

where X is a number (that is, if u, when operated on by a, is sent 
into "a numerical multiple of itself), then u is said to be an eigen- 
vector of the matrix a, X is said to be an eigenvalue of the matrix a, 
and the eigenvector u and the eigenvalue X of a are said to belong 
to each other. If u is an eigenvector of a belonging to the eigen- 
value X, then the ray cu is called an eigenray of a belonging to the 
eigenvalue X. 


For example, if we multiply the vector u 



by the matrix 


p ™ 


’ 11 -6 2 
-6 10 -4 

. 2-4 6 / 


(43) 



316 


OPERATORS AND MATRICES 


§56 


we get according to (21) 
/ 11 “6 2 \ . 


pw = -6 10 -4 


and hence 


/ IM- 6*2+2-2 
( -614-10-2-4-2 
\ 21- 4-2+6-2, 

pu = 3u. 


The vector! 2 1 is consequently an eigenvector of the matrix p, the eorre- 

A. 

spending eigenvalue of p being 3; and the ray c I 2 I is the eigen riiy of p 
belonging to the eigenvalue 3. 

The procedure for computing the eigenvalues of a given matrix 
is as follows; According to (28) the symbolic equation (42) is an 
abbrevi^ion for the N numerical equations (av)fc = (\u)k , that 
i^ [see (31) and (30)] for the N equations ^akiUi = \Uk or 
^dkiUi — \Uk = 0, which, when written out in detail, are 

f(ail ““ X)wi + 0112^2 + • • • + OCINUn = 0 

J 0^21^1 + (0:22 — X)u2 + . . -1- = 0 


I aviUi + + . • . + (aNN “ X)ujv = 0. 

In order that the system (46) have solutions other than the trivial 
= 2^2 = • • • = -Wat = 0, it is necessary and suflficient that the 
determinant of the coefficients vanish, that is, that 

(^11 “ (Xi2 ... OliY 

«21 ( 0:22 — X) ... 0 i 2 N^ _ Q 

^.>'2 . . . {asN - X) 

Now besides the preassigned quantities , Equation (47) 
mvo \ es on y t e parameter X, so that (42) has nontrivial solu- 

^ (47)-an equation 

parameter X-is called the secular equa- 

matrix are the roots of the secular equation of the matrix. Note 



317 


OPERATORS AND MATRICES 


§56 


that the computation of the eigenvalues of a given matrix can 
be carried out directly through the secular equation (47), and 
without reference to the defining equation (42). 

For example, the eigenvalues of the matrix (43) are defined to be those 
values of X for which the equation pu = Xu, that is, 


( 11 -6 
-6 10 
2 -4 



(48) 


holds (wi , Uis , and , at least one of which is required to be different from 
zero, are then the components of an eigenvector of p belonging to the 
eigenvalue X of p); however, to compute the eigenvalues of p we need not 
write down Equation (48), but rnay turn directly to the secular equation 
(47), which in the case of the matrix p is 


(11 - X) -6 2 

~6 (10 - X) ~4 

2 -4 (6 - X) 


= 0, (49) 


that is, (11 - X)-(10 - X )-(6 - X) + (-6)-(-4)-2 + (-6)-(-4)-2 - 
2-(10 - X)-2 - (- 6 )-(- 6)-(6 - X) - (11 -X)-(-4)-(-4) = 0, or simply 

-X» H- 27X2 - 180X + 324 = 0. (50) 


The roots of (50) are Xi = 3, Xs = 6 , and X 3 = 18, so that the matrix p has 
just three eigenvalues: 3, 6 , and 18. For no other values of X is it possible 
to find three numbers Wi , U 2 , and wg which satisfy (48) and which are 
not all zero. 

The following example will illustrate the procedure for com- 
puting the eigenrays of a matrix. 

Lot it be required to find the eigenray of our matrix p belonging to the 
eigenvalue 3 of p. Equation (48), when written in the detailed form (46), 
yields 

(11 — X)ui “* GUa "i" 2uz ^ 0 

— 6mi -f (10 — X)7/2 — 4?Z8 = 0 (51) 

2ui — 4ti2 “h (6 — X)u5 = 0 

so that for X = 3 we get 

8mi — 6uj + 2u3 = 0 

— 6mi + 7u2 — 4 u 3 = 0 (52) 

2ui — 4 u 2 4“ 3u3 - 0. 



318 


OPERATORS AND MATRICES 


§56 


The determinant of the coefficients of this set of three simultaneous linear 
homogeneous equations in three unknowns is known to be zero because of 
the special choice of X, and hence the set has nontrivial solutions. If 
, ua , and uz satisfy (52), then aui , auz , and auz also satisfy it, whatever 
the value of the constant a may be; consequently we may ascribe an 
arbitrary value to one of the w^s, for example, let ui = c. Equations (52) 
then become 


—6^2 -f 2 u 3 == —8c 

7 u 2 — 4tt3 ~ 6c (53) 

—4^2 4“ 3^8 = —2c, 

that is, a set of three linear inhomogeneous equations in two unknowns 
The consistency of thege equations is assured, and we may restrict our- 
selves to just two of them, say the first and the second: 


—62^2 4 " 2m3 ™ “Sc 
7 w 2 — 4 w 3 = 6c. 


(54) 


We now have a set of two inhomogeneous equations in two unknowns* 
Since the determinant of the coefficients of (M) is not zero, the set has 
one and only one solution, which is readily found to be uz = 2c and W3 — 2c; 
as a check, we verify that M2 2c and ui - 2c satisfy the third equation 
in (53). The components of an eigenvector of p belonging to the eigen- 
value 3 are consequently ui = c, uz ^ 2c, and uz = 2c, and the eigenray 
of p belonging to the eigenvalue 3 is 



(55) 


that is, the ray first mentioned just below (45). 

Any two eigenvectors of our matrix p belonging to the eigen- 
value 3 differ from each other at most through a multiplicative 
constant and are consequently linearly dependent; for this reason 
the eigenvalue 3 of p is said to be nondegenerate. If every n + 1 
but not every n eigenvectors of a matrix a belonging to the 
eigenvalue \ oi a are linearly dependent, the eigenvalue X is said 
to be n^fold degenerate, or to be of multiplicity n; an example of 
degeneracy will be given in Exercise 6. According to the theory 
of simultaneous linear equations, the maximum number of 
linearly independent eigenvectors that may belong to a given 
eigenvalue of an N-hy-N matrix is equal to the number of times 
this eigenvalue occurs among the N roots of the secular equation 



319 


OPERATORS AND MATRICES 


of the matrix. A matrix having one or more degenerate eigen- 
values is called degenerate. The totality of the eigenvalues of a 
matrix is called its eigenvalue spectrum. 


Exercises 

1. Illustrate by diagrams similar to 33®^ and 52®'^ the processes of multi- 
plymg a matrix by a matrix and a vector by a matrix when N > 2. 


2. Show that if a = 


'1 -2 3 -4 

4 3 2 1 

4-3 2-1 

12-34 


and = 


T 4 

2 -3 

3 2 

4 -1 


-1 -4\ 

2 3 I 

-3 -2 r 
4 1/ 


then 6£ -j- = 



2 

2 

-8\ 

j-10 

20 

-30 

-20\ 

r 

0 

4 

4 \ 

. / 20 

10 

0 

-10 \ 


-1 

-1 


0 

30 

-20 

-30 1 

\5 

1 

1 

5/ 

\ 12 

-12 

28 

12 / 


5 

21 

-15\ 

/I 

“25 

17 

“13 

-5 

-1 1 

25 

-3 

19 

5 

13 

-15 ' 

-1 

-25 

13 

“21 

15 

-17 

w 

21 

“11 


-10 -6 -6 6 
14 18 -2 -18 

-10 2 2 -2 
18 26 -14 -26 


, and (oc + /3)* = 


-10 -6 2 -54 

60 12 12 -40 

-14 12 8 -72 

48 14 18 -14 


verify that (a + 3)2 = 0.2 -f a/? -f- / 5 a +■ ^ 2 . 


3. Let u = 


show that au = 


and let a and (3 be the matrices so labeled in Exorcise 2; 


and fiu 


4 . Show that tlie eigenrays of the matrix (43) belonging to its respective 


eigenvalues 3, 6, and 18 are c.l 2), eJ 1 , and c,( ~2 ), where c. 



320 


OPERATORS AND MATRICES 


§56 


C 2 , and cs are arbitrary non vanishing, constants. Using the explicit ex- 
pressions for the eigenvectors, show that any two eigenvoctors of (43) 

belonging to different eigenvalues are orthogonal. 

/I 0 G\ 



arc* 


‘ 3 . 


6. Show that the eigenvalues of the matrix I 0 

\6 

and 10. Compute the eigenrays of this matrix, and, using the explitut 
expressions for the eigenrays, show that any two of its eigenvectors be- 
longing to different eigenvalues are orthogonal. 

/ 9 0 -2\ 


6. Show that the eigenvalues of the matrix | 


0 10 0 1 are 5 and 

\-2 0 6 / 

10, the first of which is nondegenerate and the second is doubly degenerate. 
Show that the eigenvectors belonging to the eigenvalue 5 of this nivatrix 


have the form ci| iO j, and the eigenvectors belonging to the eigenvalue 10 

w 

A /A 

the form C 2 I 1 j + Cal 0 j, where Ci , C 2 , and C3 are arbitrary constants. 

w Vi/ 

Verify that every three but not every two eigenvectors belonging to the 
eigenvalues 10 are linearly dependent. Using the explicit expressions for 
the eigenvectors, show that any two eigenvectors of this matrix belonging 
to different eigenvalues are orthogonal. 

7. Show that the eigenvalues of each of the three matrices 12^® are 1 

and —1; and that sample eigenvectors of tr* are and (-0 , tllOHO 


of o-y are 




and those of a, are 


C) ““(“)■ 


8. Let a and p be two matrices such that au = (3u for every vector u j 
show' that then a = ^ in the sense of the definition of matrix equality 
adopted in this section. Recall the definition of equality of openitors 
given in §2, and discuss the bearing of the present result on the operator- 
operand interpretation of matrices and vectors. Consider, in the light 
of the definitions of operator sum and operator product given in §2, the 
definitions of matrix sum and matrix product adopted in this section. 

9. Show that, if the terminology is appropriately adjusted, then the 
theorems, of Exercises 2® and 3® hold for matrices. 


67. N-Dimensional Spaces 

N-hy-N matrices and iV-coniponent column symbols arise, oftcui 
in dfeguise, in all kinds of problems in mathematics, classical 
physics, and quantum mechanics; but an intuitive insight into 
those of their properties which will be of special interest to us is 
gamed particularly readily if in discussing them we speak of thorn 



321 


OPERxVTORS AND MATRICES 


§57 


as though they haye to do with mappings of an iV-dimensional 
space; it is for this reason that we have already introduced some 
geometric terms in the purely algebraic discussion of the last 
section. Accordingly, now and then in our algebraic study of 
matrices and column symbols, we shall 

(a) speak of an A^-dimensional space (AT-space) within which 
there is fixed a coordinate frame whose origin is at a point 0, 
whose N axes arc mutually perpendicular, and whose scale unit 
is the same for all axes; 

(b) interpret an iV-component column symbol as representing a 
vector in this A’-space, that is, as being a properly ordered list 
of the components (relative to our coordinate frame) of an 
AT-dimensional vector; and 

(c) interpret an N-hy-N matrix as the representative (relative 
to our frame) of a mapping of the iV-space. 

If A" = 2 or A" = 3 and if all the components of the matrices 
and column symbols concerned arc real, tlnm the steps enumerated 
above can be actually pictured in terms of a plane or the ordinary 
three-dimensional space, so that our interpretation is then truly 
geometric rather than quasi-geometric; if N > i and all com- 
ponents are real, then the A^-spacc invoked is a fictitious one and 
is similar to the phase spaces used in statistical mechanics; in 
general, our A^’-space will be complex and hence still more abstract. 
Incidentally, the orientation and the scale unit of the coordinate 
frame referred to in (a) will be quite immaterial for our purposes. 

The algebraic meaning of the symbolic equation an = v is 
that tlie components of the column symbols u and v and of the 
matrix a are related through the equations 31^®; the quasi- 
geometric interpretation of this equation is tliat a mapping a 
(represented by the matrix a) of an A^-space sends the vector u 
(represented by the column symbol n) into the vector v (repre- 
sented by the column symbol v). In particular, the quasi-geo- 
metric interpretation of the symbolic equation au — \u is that 
the vector u of an AT-space is an eigenvector of the mapping a 
belonging to the eigenvalue X of the mapping, that is, that the 
mapping a does not remove the vector u from its original ray 
but merely multiplies it by the constant X. Thus the eigenvalues 
of a matrix (that is, the roots of the secular equation of the matrix) 
are the eigenvalues of a mapping that the matrix may be interpreted 
to represent 



322 


OPERATORS AND MATRICES 


§57 


The introduction of quasi-geometrical notions is of course essen- 
tially a matter of terminology. Thus an iV-vector was defined 
in §56 as an ordered single-file array of N numbers; and it will 
remain just that, whether or not we try to picture it as a point 
in a fictitious iV'-dimensional space. 

Exercises 

1. Interpret the matrix 43®® as representing a mapping of a 3-space, and 
investigate the figure into which this mapping sends a unit sphere with 
center at 0. 

2. Consider along the lines of the preceding exercise the matrix of 
Exercise 6®®. What, in particular, is the geometric implication of the 
degeneracy of this matrix? 


68, Further Remarks on Matrices 

The elements an , ^ 22 , • • • , and ajvw , lying on the principal 
diagonal of a matrix a, are the diagonal elements of a; the remain- 
ing elements are the off-diagonal elements of a. A matrix all of 
whose off-diagonal elements are zeros, that is, a matrix of the form 


( an 0 • • • 0 \ 

0 a22 * • • 0 

0 0 • - • 


is a diagonal matrix. The special matrices 



0 


/i 

0 ... 

0^ . 


c • • • 

0 

j 

!■ f 

1 

7 

\o 

0 

cl 

\o 

0 .. 

. 1/ ' 


( 1 ) 



are denoted by c, 1, and 0, and are called the matrix c, the unit 
matrix, and the matrix zero, respectively; since Ice = al = a, 
whatever a may be, the symbol 1 is usually omitted when the 
unit matrix is involved as a factor in a matrix product. The 
sum of the diagonal elements of a matrix a is denoted by the 
symbol tr a and is called the trace^ of the matrix a: 


tr a — an + ^22 + * * * 4" J 


( 3 ) 


* The German term is Spur, meaning trace or spoor. 



323 


OPEHATORS AND MATRICES 


§58 


for example, if p is the matrix 43*®, we have trp = ll-|-10-t-6 = 
27. The following determinant, denoted by det a, is called the 
determinant of the matrix a: 

oili aiz - - - aijv 

det a = ^21 0:22 • - - OJ2JV 

oim ocif2 • • • ajfN 

for example, if p is the matrix 43“ we have 

11 -6 2' 

detp = -6 10 - 4 = 324. 

2-4 6 

If det a = 0, the matrix a is said to be singular. 

The determinant 

(ail — X) au ... oiiN 

asi (o!2S — X) ... CKJJV 

awi <Xm • • • (a^TAr — X) 

that is, the left side of the secular equation 47“, is called the 
secuhr determinant of the matrix a; the secular determinant of a 
IS a polynomial of the iVth degree in the parameter X, and it.s 
numerical value depends of course on that of X. If X =■ 0 the 
secular determinant becomes det a, and hence, when the secular 
determinant is expanded in powers of X, the term independent of X 
is just det a] further, the terms involving X^ and X"~’ all come 
from the product of the diagonal element.s of the secular deter- 
minant; and, since (an - X).(a22 - \)---{ochn - X) = (-X)'^ 

+ (— X) tr a -|- • • • , we conclude that the secular deteiminant 
has the form 

(~ + (— X)^ * tr a + • • . 4- det a. (6) 

The coefficients of X^ X^ and so on, in (6) arc also definite 
combinations of the elements of a, but we need not consider them 
in detail here. 

The secular equation of a thus has the form 

(— X)^ + (— X)^ ‘ tr a 4- ... 4- det a = 0. 


( 6 ) 


(4) 


(7) 



324 


§58 


OPERATORS AND MATRICES 

Now, if Xi , X 2 , • • • ; and denote the N roots of (7), th(‘n (/) 
can be written in the factored form 

(Xi _ X) • (Xo - X) • • • (Xjyr — X) =0; (8) 

expanding the left side of (8), we get 
(-X)^ + (Xi + X 2 + • • • + + ‘ * * 

4” Xi*X2 ‘ * • Xj\r d, (i^) 

and comparison with (7) shows that 

X] 4" X 2 “h * * * 4" ~ 

and 

Xi • X 2 • • • Xjv = det oi. ( 1 ^ ) 

Thus the sum and the product of the eigenvalues of «a matrix are 
equal, respectively, to the trace and to the determinant of the 
matrix. It follows from (11) that a matrix having the eigen valiu* 
0 is singular, and conversely. 

The reciprocal of matrix a is the matrix, denoted by a \ 
w^hen multiplied into a from either side, yields the unit nuitrix 
as the product: 

cia~^ = = 1. (12) 

The nmih element of turns out to be 


)nm — 


1 d det a ^ 
det a doimn ^ 


(13) 


the quantity d det a/damn is called the cofactor of the clement amn < 
For example, if 

(14) 

\a2i 0:22/ 

then det a = oiiia 22 — 0 ^ 120 : 21 , so that d det a/ dan = 0:22 > c) dc^t a/ dajs 
= — a 2 i , 9 det a/da 2 i = — ai 2 , and d det a/da^o ~ cun i hence, 
writing the common factor (det outside, we have 


1 

aiia22 — o'i2Q:2i 


/ ^22 
\ — a:2i 



(15) 


The factor (det d) ^ in (13) shows that a singular matrix lias no 
reciprocal; it then follows from (11) that a matrix has no reciprocal 
if it has the eigenvalue 0. 



325 


OPERATORS AND MATRICES 


§58 


The matrix 


an 

a2i 

• • • 

ajsri 

ai2 

Oi22 


OLN2 

aiy 

OitN 

. . . 

OtNN 


where is the nmth element of the matrix a, is the transpose 
of the matrix a; for example, 

/I 2 + f 3-2A /I 4 7\ 

if«= 4 5-i 6 + 3 A then 5= 2 + i 5~i 2 1.(17) 

V 2 9 / \3-2» 6 + 3f 9 / 

Denoting the nmth element of <5 by &nm , we have the relations 


The matrix 


^nm — 


3|e 9ie 

an «21 

9ic * 

ai2 a22 


(XiN 


where aj,, is the complex conjugate of the nmth element of the 
matrix a, is the complex conjugate of the matrix a; for example, 

/I 2 + t 3-2f\ /I 4 7\ 

ifa= 4 5-i 6 + 31, then 5= 2-i 5 + f 2 . (20) 


\7 2 9 / \3 + 2i 6 - Zi 9/ 

Denoting the nmth element of a by ccnm , we have the iV® relations 

= cCm ■ ( 21 ) 

Note that the computation of a involves a transposition of rows 
and columns of a, so that, usually, a is not the matrix whose 
elements are the complex conjugates of the corresponding elements 
of a. 

If 

a = a, (22) 


Oinm — $ 


that is, if 


( 23 ) 



326 OPERATORS AND MATRIC^KS 5.W 

the matrix a is said to be Hermitian; the matrix 43**, th«f mafripi*-'" 
/ 5 3 + 2A 

12 , and the matrix! ^ ^ J a,ro <'xamp!<‘s of FI«>rinitiari 

matrices. According to (23) the diagonal (‘lemenf h of a Hormitian 
matrix are all real. We shall show in §69 that every e.iyenvalue of a 
Hermitian matrix is red, and that any two eigerweetors of a /fert»i~ 
tian matrix belonging to different eigenvalues of the matrix are or- 
thogond] for the present the correctnesa of these as.serti«iis will 
be taken for granted. 

If a vector u is simultaneously an eigtinvector of two matriees 
a and that is, if au = Xtt and /3 m = yeu, where X and y are con- 
stants, then M is a simultaneous eigenvector of a and /S. If «/3 = tin, 
the matrices a and fi commute/, if a/3 = — fia, the matriee.s oe and ft 
aniicommute; the matrix a/3 — fia is the commutator of « and 0. 

Exercises 

1. One of the eigenvalues of a 3-by-3 matrix a is 6, tr a is 12, ami 4 iet a 
is 60; find the remaining eigenvalues of a. 

2. Show that det (a^) = det (fia) = (det a) • <det /S), and that dot {c«) - 
det a, 

3. Show that the eigenvalues of a diagonal matrix are itn (liagonal 
elements. 

4. Use (13) to compute when « is a 3-by-3 matrix with unajwjciflml 
elements an , an , and so on; check the result by matrix inultipiieatiors. 

6. Show that 


(afi) ^ ^ IS ^a~'^j 


ai^ « j3(y, 


oe/3 - jtfot; 


Vf 0 ) 


reciprocal, the transpom^, nn<i th.- oomnlox 


( 18 aM) 


(fn)i = fiiMi + fiiuj 
(fu)j = fsiWi + {21U2 


(25) 


carriwl out, coincide "hieh, Wore the m.tcpin, i. 



327 


OPEEATORS AND MATRICES 


§58 


and liUij the latter vectors in general no longer coincide with ei and 62 , 
which, being fixed with respect to the coordinate frame, are not affected 
by the mapping. Show that the vectors |C/i , ^27*, , and satisfy 

the equations 

= \i\U\ -f ixiUi 
^ 2/2 = iiiUi + 2 . 

8. Show, adjusting the terminology whenever necessary, that the 
theorems of Exercises 6® and 3^, and the last theorem of §6 hold for matrices 
and vectors in general; and that those of Exercises 2^^ and 3*^ hold for 
Hermitian matrices. 

9. Use the quasi-geometric interpretation of matrices as mappings to 
convince yourself that (a) a matrix having the eigenvalue 0 can have no 
reciprocal, (b) the eigenvalues of a diagonal matrix are its diagonal ele- 
ments, and (c) the eigenvectors of a diagonal matrix belonging to distinct 
eigenvalues of the matrix are mutually orthogonal. 

10. Let ai8 = iSor, where a is a nondegenerate diagonal matrix; show that 
then i3 is also diagonal . 

11. Write down a singular 2-by-2 matrix with simple non vanishing 
numerical elements; interpret it as representing a mapping of a plane and 
construct the figure into which the mapping sends the unit circle. 

12. Explain why the following argument is fallacious: 'According to a 

result of Exercise 4®*, the matrix listed in 15®* satisfies the equation 
(1 -I- 7^)(1 -f y^) = 2(1 -f 7^); multiplying both sides of this equa- 
tion on the right by the reciprocal of the matrix 1 -f 7^, we get 
(1 -f 7^)(1 + 4- 7"*)-^ - 2(1 + 7^)(1 + that is, 14*7'’ « 2; 

hence 7^ « 1,^' 

13. Show that every two of the three matrices 12®® anticommute with 
each other; that every two of the four matrices 10®^ anticommute with 
each other. 

69. Function Space 

We have already seen that mapping operators can be represented 
by matrices; our next task is to show that differential operators 
can also be represented by matrices. As a preliminary step we 
shall show that differential operators can be interpreted as map- 
pings of a certain space, called function spaced 
To introduce the notion of a function space we begin with a 
simple case.^ Let fi — fi(x) and /2 == /2(»), to which we shall 


® Our use of the term function space will be somewhat unprecise. See 
function space and Hilbert space in Kemble. 

* D. Jackson, Am. Math. Monthly^ 31, 461 (1924). 



328 OPERATORS AND MATRICES §59 


refer as the lade f% be two real functions of z which are ortho- 
normal in an interval (a, 6), so that 



and let us restrict ourselves to real functions u “ u{x) which are 
linear combinations of fi and /z , and thus have the form 

u = Olfl + ^/2 , (2) 


where Ci and C 2 are arbitrary real constants. To specify any 
particular Uj it is necessary and sufficient to quote two specific 
real numbers Ci and C 2 ; that is, a particular u is specified in the 
same manner as is a particular point in a real plane. This sug- 
gests that we may associate each of our u*s with a vector in a 
plane by drawing, as in Fig. 93, a rec- 
tangular coordinate frame, associating the 
basic fs with the basic unit vectors of 
this frame, and associating a function u 
with the vector whose components are 
Cl and 62 . Note that \{ v) = u + v, where 
u and V are both real linear combinations 
of fi and , then the vector associated 
with w is the sum, in the usual sense, of 
the vectors associated, respectively, with the functions u and v. 
We call the plane shown in Fig, 93 the function plane of the 
functions (2), because each vector in this plane can be associ- 
ated with a particular function of type (2), and conversely. 

For example, if the basic f s are 



plane. 


/i = iV sin a;, /a = iV sin 2x, (3) 

where N = ^“1, then every function u of the form 


w - ci/i 4- czfz = Cl iV sin a; 4- C 2 iV sin 2xj (4) 


where the c*s are real, can be asso- 
ciated with a vector in the plane 
of Fig. 94 j the plane of Fig. 94 is then 
the function plane of functions of 
the form (4). An interval over 
which the f*8 are orthonormal in 
this case is (— tt, x). 

^ nie various geometric quan-. 
titles pertaming to a function 
plane stand in a one-to-one corre- 




329 


OPERATORS AND MATRICES 


§59 


spondence with certain analytic expressions pertaining to the 
functions u. For example, according to Fig. 93, the square of 
the length of the vector w is c? + cl ; now, i^?dx = /(ci/i + 
Ctfifdx = cl i/idx -|- 2ciCsififidx + cl fftdx, so that in view 

of (1) we have [ u^dx = cl + 4 , and hence 



u^dx = square of the length of the vector u. 


( 5 ) 


We speak of the length of the vector 
associated with a function u simply as 
the length of the function u. 

Angles in our function plane also have 
a direct analytical meaning. Indeed, 
according to Fig. 95, in which both u 
and V have the form (2), 



OA* + OB^ - 2.0A 0B.COS 0 = (6) 

, in view of (5), OA^ = j u^dx, OB^ = j v^dx, and AJ5* = 

(w — v^dx — / v^dx + / v^dx —2 uvdx; substituting 
Ja Ja Ja 

these integrals into (6) and solving for cos we get 

b 


now, 

i: 


cos = 


/■ 


uvdx 


^ j u^dx J v^dx 


( 7 ) 


We speak of the angle between the respective vectors associated 
with two functions u and v simply as the angle letween the functions 
u and V. If 


/. 


6 


uvdx == 0, 


( 8 ) 


then, according to (7), the vectors associated with u and v are 
mutually perpendicular; perhaps the term orthogonal, which we 
used all along for real functions satisfying (8), will now appear 
particularly appropriate. 



330 OPERATORS AND MATRICES §59 


If/i ,/a , and/3 are three real functions of x that are orthonormal 
in an interval (fl, b), then the functions 

u = ci/i + ca /2 + Cifz , (9) 

where the c's are arbitrary real constants, can be associated with 
vectors in a real three-dimensional space, the three basic fs being 
now associated with the three unit vectors of a rectangular co- 
ordinate frame. The formulas (5) and (7) carry over directly to 
this case. 

More generally, we may be concerned with functions of the form 

= Cifi + cof^ -f- . • . , (1®) 

where the fs are real functions of x that are orthonormal in an 
interval (a, b). In this case we can still speak of the 2 i's as being 
associated with vectors in a certain space; but the space now has N 
dimensions, and our interpretation is quasi-geometric. The 
formulas (5) and (7) are taken over for this case without change. 

A further generalization lies in allowing in (10) to be infinite, 
so that the w’s are infinite series:® 


= Ci/i + C 2/2 -h • • • + C,/;fe + . . . . (11) 


This case is of particular interest, since it arises in connection 
with expansions of real functions in terms of infinite orthonormal 
sets of real functions, such as the trigonometric functions (Fourier 
series), the Hermite functions (Gram-Charlier series), and so on. 
The function space now has an infinite (though countable) number 
of dimensions. 


The final generaUzation to be considered at present is that when 
the hmcfs and the c's in (11) are allowed to be complex, so that 
^e z. s under consideration are, in general, complex functions of x. 
The fusion space now invoked is a complex space of infinitely 
ThT therefore very much of an abstraction. 

j:’ ”” ™ M- 


r 


Uiidx 


square of the length of the vector u. 


( 12 ) 


venient tha/t^^om notation of (11) is somewhat more con- 



331 


OPERATORS AND MATKICES 


§59 


The formula (7) for the cosine of the angle between two fimctions 
now also needs modification, which, however, we need not con- 
sider here; suffice it to state that the condition for the ortho- 
gonality of the vectors associated with two functions and v is 

(3®) J* mdz = 0 (13) 

rather than (8). 

To summarize: When two 'real functions of x, orthonormal in 
an interval (o, 5), are given, then each function u that is a real 
linear combination of these two functions can be associated with 
a vector in a plane called a function plane of the u's; the association 
is readily visualizable, and to the geometrical elements of the plane, 
such as lengths and angles, there correspond simple analytical 
expressions. The situation is no more complicated when all func- 
tions v. under consideration are real linear combinations of three 
real orthonormal functions; the ‘function space' then has three 
dimensions, but is still directly visualizable. Quasi-geometric 
abstraction begins when we consider functions that are real linear 
‘ combinations of more than three (in particular, infinitely many) 
orthonormal functions, or when we deal with complex functions; 
a direct intuitive grasp of the entire function space is then no 
longer possible; the notion of a function space, nevertheless is 
still useful because it enables us to employ geometric terminology 
and geometric analogy in a rather suggestive way. 

Mappings of function space. The concept of fxinction space 
enables us under certain circumstances to interpret differential 
operators as mappings. We begin once again with an example. 

Suppose that wo are investigating the effect of the particular operator 
(which we shall denote by on the particular functions (4), 

We have 




-- - w — ~ (cii\r sin X + ciN sin 2a;) 

ds^ az® 


« ciN sin a; 4- sin 2x. 


( 14 ) 


Thus is again a real linear combinaton of the functions (3), and hence 
the functions as well as the functions w, can be associated with vectors 
in the plane of Fig. 94. The respective vectors associated with u and with 
€u are indicated in Fig, 96, where the/^s stand for the particxilar functions 
(3), % stands for a function of the particular form. (4), and $ stands for 



332 


OPERATORS AND MATRICES 


§59 




Fig. 96. 


Fig. 97. 


the particular operator ^d^/dz^. We can therefore interpret the process 
of operating with —d>ldz^ on our w’s as a mapping of a function plane) 
the ellipse into which this mapping sends the unit circle is shown in Fig. 97, 
together with the unit circle. 


In general, let the functions u under consideration have the 
form 


(II) U — C\fi + 02/2 . -f. CjeJk + - • ■ 7 (15) 

where the c's are constants and where the fs are orthonormal in 
some interval; and let an operator f be such that for an arbitrary u 
of the form (15) we have 


- di/i 4- dstf^ + • • ’ + dkfk + • • • , ( 16 ) 

vhere the ds are constants, and the/^s are the same as in (15). 
The function space that * accommodates^ the Z 4 ^s will then also 
accommodate the functions fw; and hence the effect of the operator 

i on the functions u may be pictured as a mapping of a function 
space. 

When a function space is used, it is sometimes convenient to 
represent a function by a colunrn symbol whose elements are the 


333 


OPERATORS AND MATRICES 


§59 


'components' of the function relative to the coordinate frame of 
the function space. The functions (2), for example, can be 
written in the form 



the basic functions fi and /a being kept in mind. Had we used 
the symbols Ui and U 2 for the numbers ci and in (2*), the notation 
of (17) would have been precisely like that of 42^^. 

Exercises 

1. Use a function plane to show geometrically that, if the functions Ui 
and Uz are orthonormal in an interval (a, 6), then the functions vi = 
(wi 4- U2)/\/2 and vz = (wi — ug ) /^/2 are also orthonormal in (a, b). Verify 
this result analytically, allowing Ui and uz to be complex. 

2. Prove (5) and (7) when u and v have the form (9). 

3. Let wi , Ui f and uz be real functions, orthonormal in (— «, «). 

Verify the equation f(ui + dx « f and correlate it with the 

fact that the distance between a vertex of a unit cube and the center of a 
nonadjacent face is Vl- Verify the equation 

l/(ui + U 2 )(ui + uz) dx]-[J{ui 4- uz)^ dxrHl + UzYdxy^ = i 

and correlate it with the. fact that the face diagonals of a cube, passing 
through a common vertex, make an angle of 60° with each other. 

4. Discuss the significance of the fact that the basic /'s do not appear 
in (6), (7), (12), and (13). 

6. Let u = SfCt/» be the expansion of a real function u in terms of an 
orthonormal set of real functions; express c* as an integral and show, in 
the light of (7) and (5), that c* may be properly called the fcth compo- 
nent of u. 

6. Discuss the relation between the eigenvectors of the mapping of 
Fig. 97 and the eigenfimctions [having the form (4)] of the operator 
—d^/dx^\ do the same for the eigenvalues. 

7. Convince yourself on quasi-geometric grounds that, if an operator $ 
satisfying (16) is interpreted as a mapping of a function space, then the 
eigenfunctions of $ having the form (15) are associated with the eigen- 
vectors of the mapping; consider the eigenvalues of the operator and of the 
mapping in a similar light. 

8. Show that the operator d/dx cannot be interpreted as a mapping 
of the plane of Fig. 94. Use two functions from the set 25® to construct 
a function plane in terms of which the operator d/dx can be interpreted 
as a mapping; what conclusion regarding the eigenvalues of d/dx can be 
drawn from the geometry of this mapping? 



334 


OPERATORS AND MATRICES 


60. Representation of Differential Operators by Means of 
Matrices 


As we have just seen, a differential operator can under certain 
circumstances be interpreted as a mapping. This mapping can 
in turn be represented by a matrix; and the matrix so obtained is 
said to represent the differential operator. 

For example, insofar as the effect of the operator f on func~ 

tions of type 4'® is concerned, this operator can. be interpreted as a mapping 
of a plane— in fact, as the mapping pictured in Fig. 97. Now, inspection 
of Fig. 96 shows that, in terms of the coordinate frame given in the figure, 
the mapping in question is represented by the matrix 

This matrix is said to represent the operator — d^/dx^ insofar as the effect 
of the latter on functions of type 4®* is concerned. 

The matrix (1) can be found also by the following systematic method : 
If ^ is a mapping of a plane, and Ui and U 2 are the vectors which, before 
the mapping is carried out, coincide with the unit vectors of the coordinate 
frame, then 


(26W) 


- hiUi -h h2U2 
^ T^2iU I 4" \ 22 X 7 2 y 


( 2 ) 


where are the elements of the transpose of the matrix representing the 
mapping. Now, the vectors coinciding with the unit vectors of the coordi- 
nate frame of our function plane are f, and /, , so that we may rewrite 


^*2 


€iiji 4 - ^12/1 
' ^21/1 + \i2f2 , 


(3) 

representative wo are 

L 0 “ scansions of the functions and £f. 

(a h't th orthonormal in some interval 

(o, 6), the expansion coefficients can be evaluated by means of 21»; in fact, 

hi =1 hlfi dx, hi = Jl hSfi dx, hi = At/a dx, and 

These four relations can be summarised by writing d., and 

consequently •'* 


1: 




(4) 



335 


OPERATORS AND MATRICES 


§60 


In our special case, ^ is —d^/dx^, the/*s are given by 3*®, and the interval 
(a, h) can be taken as (— tt, t). Substituting into (4), we then get, for 

example, fn = J Ji^xdx = j N ain x{-‘d^/dx^)N Bin x dx - 1, in 


agreement with (1). The reader should verify that the three remaining 
elements of (1) are also given correctly by the formula (4). 


In dealing with the problem of representing differential oper- 
ators by means of matrices, we shall be concerned with the case 
when the functions u under consideration have the form 


(15**^) u — Cifi -j- Czfz + • • • "h ^kfk + • • * j (6) 


where the c's are constants and the fs are orthonormal in an 
interval (a, b), and when an operator ^ is such that for an arbitrary 
u of the form (5) we have 

(16 ) = difi -j- djs/2 -[“•*• 4" dkfk 4” * • * ; (6) 

where the fs are the same as in (6) and the d's are constants. 
As pointed out in §59, the effect of { on the can in this case 
be interpreted as a mapping of a function space, and our problem 
is to compute the matrix representing this mapping when the 
unit vectors of the coordinate frame of the function space are the 
vectors associated with the basic fs. The reader will perhaps 
readily convince himself that quasi-geometric reasoning, pat- 
terned after the geometric reasoning of the preceding example, 
leads to the conclusion that the nmth element of the matrix 5 
(representing the operator J) is again given by (4). The matrix 
now has an inj&nite number of rows and columns. In the sequel 
we shall usually deal with operators functions u, and sets of 
basic fs satisfying the four following conditions: the fs are 
orthonormal in the infinite interval, so that (4) becomes 



the fa form a set that is complete with respect to normalizable 
functions; the u^s are normalizable; and the functions fw are also 
normalizable. 

In an exposition not emplojdng quasi-geometric inferences, 
Equation (4) would be written down as the definition of the matrix 
^ representing the differential operator and Equation (7) ^ould 



OPERATORS AND MATRICES 


336 


§60 


be regarded as the special case of this definition, referring to basic 
fs that are orthonormal in the infinite interval. 

Quasi-geometric reasoning suggests that if, using a specific set 
of basic /’s, we compute the matrices representing, respectively, 
the three differential operators a, jd, and y = o: -f- then the 
matrix representing 7 should be the sum of the matrices repre- 
senting a and p. This surmise is readily verified analytically.* 
denoting the three matrices by or, jd, and 7 , and recalling 20 **, 
we have according to ( 7 ) 

rn™ = = jhia -f /3)/„(fe: = jf^afmdy: + (8) 

and it follows that 7 „„ = ^ ^ that is, that the matrices 

satisfy the equation 7 = o: + jd. 

Similarly (as ^ quasi-geometric reasoning again suggests) the 

matm representing the differential operator r = «| 3 is the product 

aiJ of the matrices a and ^ representing the operators a and /?. 

ee , accor ing to (7), the ramth element of the matrix repre- 
senting 7 is ^ 




( 10 ) 


Ynm — JfnOSfmdx] 

expanding /3/„ in terms of the fs, we get 

™ Tihi/i, \.iu; 

operators; heoc. (10) 'e«. i 'OP'oeontin* the 

^fn = fll'l 

and C9) as ' 


(12) 


T»m - 

maltii r^praLittag tte ^»tor ' of tie 

be written as r.® = V «. « ^ numbers, the last equation can 
V = has been pro^d "£ 70 ^ equation 

the one preceding even theorem, and 

between two or more d^ere,Uml 

^trices rej>resenting ietweert the 

«=mv^ed mth T^erence to the same eeToJ^ mairicee are 



337 


OPERATORS AND MATRICES 


One of the main reasons for our interest in the subject of repre- 
sentation of differential operators by means of matrices is that, 
whenever every eigenfunction of a differential operator J can be 
expanded in terms of the basic fs, every eigenvalue of the differ-' 
ential operator f is also an eigenvalue of the matrix representing f ; 
hence a search for the eigenvalues of a differential operator may 
be replaced by a search for the eigenvalues of a matrix representing 
this operator. The proof runs as follows: Let u be an eigen- 
function of ^ belonging to the eigenvalue X, so that 

== \u; (13) 

let C denote for the moment the matrix representing f , so that 

Kki = (14) 

and let u denote for the moment the vector (column symbol) 
with components 

Ui = jfiUdXj (15) 

which are infinitely many in number.® Expanding u in terms of 
the basic /s, we have u = ^iCifi, where Ci = ffiudxj that is, 
Ci = Ui , so that (13) can be written as 

• (16) 

Mul%lymg (16) through by/* and integrating, we get 
— x£<Ui//*/ida:, that is, = x2iUiJ/*/,da:; on account 

of the orthonormality of the fs, this becomes = Xu* , 

so that 

^i*iU, " Xu* • (1*^) 

Now, the right side of (17) is the Mh component of the vector 

Xu, while according to 3l“ the left side of (17) is the A:th com- 
ponent of the vector ?u; and since (17) holds for every value of k 
we conclude that 


lu = Xu, 


(18) 


* The vector u is associated in function space with the function u, 
that is, with the eigenfunction of ^ belonging to the eigenvalue X; recall 
Exercise 7*^®. 



338 


OPERATORS AND MATRICES 


that is, that the vector li is an eigenvector of the matrix i be- 
longing to the eigenvalue X of Consequently an eigenvalue of 
the operator f is also an eigenvalue of the matrix representing 

Note that according to (7) the elements of a matrix representing 
a given operator depend in general on the choice of the basic /'s, 
so that the term Hhe matrix representing the operator J ^ should 
be used only when a definite set of fs is referred to. From the 
quasi-geometric standpoint, a choice of the fs has to do with a 
choice of a coordinate frame in function space; and, as indicated 
by the simple examples of §55, the structure of a matrix repre- 
senting a mapping may depend on the coordinate frame employed 
in constructing the matrix. 

Exercises 

1* Using 4P and 43^, show that, if the Hermite functions u* listed in 
33® are chosen as the basic fs in such a way that fk^Uk, then the matrices 
(denoted by x and a;*) representing respectively the operators x and a:* 
have the forms given below. These matrices have infinitely many rows 
and columns, and only their upper left-hand corners are shown. Note 
that in view of the labeling of the present fs the rows and the columns 
of the matrices should be labeled 0, 1, 2, - • , rather than 1, 2, 3, • • • 



( 20 ) 


The apostrophe is used in (19) and (20) in the same way as in Exercise 7®, 
so that the entry Vl’ in the top row of (19) stands for the 

entry Vl m the next row of (19) stands for and so on. 

f t»y matrix multiplication that the matrix (20) is the square 

^ ® (19) ; in this calculation it is of course necessary to write 

out m full the phase factor of each element. 


** = i 1 0 V2’ 0 0 

0 3 0 Ve' 0 

v^’ 0 5 0 vH’ 

0 070 



339 


OPERATORS AND MATRICES 


3. Show that in terniB of the basic f^s of Exercise 1 the matrices repre- 
senting the operators d/dx and d^Jdx^ are 


dx \/2 


dx^ “ 2 


0 Vi’ 0 0 0 

-Vi’ 0 V2’ 0 0 

0 -V 2 ’ 0 Vs’ 0 

0 0 -Vs’ 0 VI’ 


“1 

0 

V2’ 

0 

0 

0 

-3 

0 

VI’ 

0 

V2’ 

0 

-5 

0 

Vn’ 

0 

VI’ 

0 

-7 

0 


( 21 ) 


( 22 ) 


4. Show that, if the basic /’s are chosen to be the functions 


fo 


/« =■ s 



(0 


a; ^ 0 

0 ^x ^ V (23) 

ir ^ Xf 


where n « 1, 2, 3, • • • , then the matrix representing the operator x is 


X 


2257r® -800 0 

“800 2257r® -864 ••• 

0 -864 2267r2 ... 


(24) 


The requisite integrals can be evaluated by reference to Note that 
this matrix x is quite different from (19). 

6. The operators x and d/dx satisfy the equation {d/dx)x — x{d/dx) =* 1 ; 
verify that the matrices (19) and (21) also satisfy it. Why cannot the 
matrices (24) and (21) be expected to satisfy it? 

6. Show that, if the basic /’s are taken to be the normalized eigen- 
functions of a Hermitian differential operator ( having a nondegenerate 
spectrum, then ^ itself is represented by a diagonal matrix. Why cannot 
a similar statement be made if $ is a degenerate Hermitian operator? 

7. What matrix represents the operator —d^/dx^ -h a;“ when the basic 
/*s are the Hermite functions of Exercise 1? 

8. Restrict yourself to the case when the basic /’s are orthonormal in 



340 


OPERATORS AND MATRICES 


the infinite region, and show that a matrix representing a Hermit ian oper- 
ator is a Hennitian matrix. 

9. Verify that the matrices (19) and (20) are Hermitian; remember 
that i appears in the phase factors. Find the matrix representing the 
operator —id/dx when the basic /’s are the Hermite functions, and verify 
that this matrix is Hermitian. 


61. Concluding Remarks 

The principal aims of this chapter were: first, to outlin<^ in 
§§56 and 58 the fundamentals of matrix algebra; and second, to 
show in §60 how and in what sense differential operators can be 
represented by matrices. The main results of §60 are the for- 
mulas 4“ and 7“ and the three theorems following 7*®. Note that, 
apart from the i^e of some geometric terms, the contents of §§S6 
^d 58 are entirely algebraic and that, as already mentioned, 
insofar as the contents of §60 are concerned, we might define the 
matrix representing a differential operator | by means of 4“ and 
thus eliminate the quasi-geometric aspects of the discussion. 

The reason for our having discussed mappings and introduced 
geometric and quasi-geometric notions in spite of the fact that 
these are not absolutely essential for the problems that are to 
follow IS twofold: first, a study of mappings and of their repre- 
sentation provides a beginner with suitable drill in the manipula- 
tion of matnees; and second, such a study helps to acquire an 

f ?? of the purely algebraic 

results of this and of the following chapters. The extent to which 

the qu^i-geometric standpoint is helpful varies of course from 
person to person. 



Chapter X 

ELEMENTS OF THE HEISENBEEG METHOD 
62. Heisenberg Matrices 

The Heisenberg theory was the first form of quantum mechanics 
to be invented;^ the formulation of the Schroedinger theory fol- 
lowed within a few months;^ and the fact that the two outwardly 
dissimilar theories, suggested to their respective authors by inde- 
pendent and entirely different considerations, are nevertheless 
fundamentally identical from the mathematical standpoint was 
proved a few months later.^ In approaching the Heisenberg 
theory, we might begin with Heisenberg’s original assumptions, 
quite independently of all the discussion of quantum mechanics 
which we presented so far; or we might do as we did in the case 
of the p-method, namely, begin with our general postulates I, II, 
and IV, and then, without regard for our work with the Schroe- 
dinger method, add further assumptions, stated how in a form 
appropriate to the Heisenberg method; it is, however, most 
expedient for us to introduce certain features of the Heisenberg 
method in the light of our knowledge of the Schroedinger method. 
This approach has the disadvantage of not making it at once 
clear how the Heisenberg method could have been invented before 
Schroedinger’s; but this disadvantage is quite offset by the 
simplification of the presentation which this approach permits. 
Besides, it is often advantageous in practice to use the Schroe- 
dinger method in some parts and the Heisen,berg method in other 
parts of the same computation; and consequently it is perhaps 
more important for us to emphasize the interrelation between the 
two methods, rather than to stress the fact that either of them 
can be formulated without reference to the other. 

Our discussion will be confined to bound one-dimensional 


1 W. Heisenberg, Zeits, f. Physik^ 33, 879 (1926). 

* E. Schroedinger, Ann. der Physik, 79, 361, 489, 734; 80, 437 ; 81, 109 
(January to June, 1926). 

® E. Schroedinger, Ann. der Physihj 79, 734 (1926); C. Eckart, Phys. 
Rev., 28, 711 (1926). 


341 



342 


THE HEISENBERG METHOD 


§62 


motion of a particle in a conservative field (in which case the 
Schroedinger eigenfunctions of the Hamiltonian are nonnaliajahh* 
and the energy levels are discrete and nondegencrat<0 
dynamical variables that do not involve t explicitly; btit many 
of our results hold more generally. 

The operators associated with dynamical variahlcB in the IJeiseU'* 
berg method are matrices^ and the procedure for computing the 
Heisenberg matrix a which is associated with the dyn fiinicuil 
variable a pertaining to a dynamical system whose Hamiltonian 
is H consists of the following steps z 

(a) We identify the Schroedinger operator H associated with 
the Hamiltonian of the dynamical system under consideration, 
^d compute the normalized time-dependent Schroedinger 
functions of H, say 






belonging to the energy levels Ei, Ei,Ez, ; 

Identify the Schroedinger operator a associated with the 
yn^cal vanable a under consideration; and 

eomtnwh (1), we 

tSfril "fw representing the Schroedinger operator «; 
ms matrix, that is, the matrix with elements 


~ J ocxl^m dx, ^ 2 ) 

IS ^e desired Heisenberg operator. 

lowing J^^m^ wS^ summarized in the fol- 

matrices- ihe SeLcnh as the definition of Heisenberg 

rena6fe«Li7r?r«7* dynamic^, 

u tke " Hamiltonian H, 

ofH. ^me-aependent Schroedinger eigenfunctions 

italics just theorem stated in 

holds between two or more relation that 

J® <^^‘n>mding Heisenberg between 

<imger operators satisfy the comm +' Particular, the Schroe- 
quantum condition l“ and henoA+if’^w^ required by the 
conform to this condition Heisenberg operators also 



343 


THE HEISENBERG METHOD 


§62 


It also follows from this definition and from the result of 
Exercise 8®** that Heisenberg matrices are Hermitian, and conse-- 
quently their eigenvalues are real.^ Hence the use of Heisenberg 
matrices as operators to be associated with dynamical variables 
is compatible with the operator assumption I of §13. 

Examples. To illustrate Heisenberg matrices, we shall now 
construct some matrices pertaining to a linear harmonic oscillator. 
In this case the Schroedinger operator associated with the Hamil- 
tonian is 

where k = 2mlh^j and the normalized time-dependent Schroe- 
dinger eigenfunctions of the Hamiltonian are (see 7^®) 

n = 0, 1, 2, • • • , (4) 

where 

&=(» + «)»., 

and iV'n == where En is the Hermite polynomial 37*, 

and where the 7's are arbitrary real constants, independent of 
both X and t] we shall also use the abbreviation 

( 25 ^®) h = hla^ a^/En == ( 6 ) 

According to the definition (2), the Heisenberg matrix asso- 
ciated with the coordinate rr, when the system under consideration 
is the linear harmonic oscillator, has the elements 

~ ^ 7 ) 

where the are given by (4), and x in the integrand is the 

Schroedinger operator associated with the coordinate x; the sub- 
script m in {7), which takes on the values 0, 1, 2, - ■ • , is of 


* The reality of the eigenvalues of a Hermitian matrix, taken for granted 

in §58, will be proved in §69. 



344 


THE HEISENBERG METHOD 


course not to be confused with the mass of the particle. The 
integral in (7) is essentially of the form 41®, and we get 

'a\/^ if m = n - 1 

a:„„ = J oV|(« + 1) jf ^ ^ + 1 (g) 

0 otherwise. 

V 

Using the apostrophe, as before, for the phase factor exp [— i(yn 
— 7 «x)] and the prime for the time factor exp [i(En — Em)t/h]f 
we can abbreviate (8) to 

a\/ if m = n — 1 

Xnm = < CLV^in+ \y' ifm = n+l (9) 

^0 otherwise. 

The matrix x thus has the form 



Note that the rows and columns in (10) are to be labeled 0, 1, 2, • • • , 
rather than 2, 3, • • • . 

Similarly, the Heisenberg matrix p associated with the momen- 
tum of the linear harmonic oscillator has the elements 

Pnm ~ J dXj (H} 

where the are given by (4) and p in the integrand is the 
Schroedinger operator p = —ihdldx associated with the momen- 
tum, so that, more explicitly, 

Comparison of (12) with 42® shows that 

if m = n - 1 

Pnm = < + 1) if m = n •+- 1 (13) 

. 9 otherwise, 



345 


THE HEISENBERG METHOD 


§62 


that is, that 

Pnm = S 1)’^ 


0 


The matrix p thus has the form 


if m = n — 1 
if m = n + 1 
otherwise. 


(14) 


JL 

0 

-VI" 

0 

0 

0 

V2 

VI" 

0 

-V2" 

0 

0 


0 

V2" 

0 

-Vz” 

0 


0 

0 

Vs" 

0 

-VI” ••• 


where the labeling of rows and columns, as in the case of all 
Heisenberg matrices pertaining to the oscillator, is the same 
as in (10). 

Again, the Heisenberg matrix H associated with the energy of 
the linear harmonic oscillator has the elements 


Hnm= j (16) 

where the i^’s are given by (4), and H in the integrand is the 
Schroedinger operator (3) associated with the Hamiltonian of 
the oscillator. Since in (16) is the eigenfunction of H 
belonging to the eigenvalue E„,, we have 
so that in view of the orthonormality of the 



Now, E„ = (n + i)hvc, and hence 


if m = n 
Un 7^ n. 


(17) 



0 

0 

0 ... 

0 

^hve 

0 

0 ... 

0 

0 

^hva 

0 ... 

0 

0 

0 

ihvc • . . 


• • • 

• • • 

• • « • * « 


(18) 


Thus H is a diagonal matrix whose diagonal elements are the 
energy levels of the oscillator. 



346 


THE HEISENBERG METHOD 


§62 


If the Heisenberg matrix associated with a dynamical variable a 
is available, then the matrix pertaining to the same dynamical 
system and associated with a dynamical variable that is a function 
of a can be computed by matrix algebra without invoking the 
defining equation (2). For example, if the matrix x for the 
oscillator, that is, the formula (9), is available, then the matrix 
for the oscillator can be computed by sauaring the matrix x. 
In fact, we have 

(26“) xL = , (19) 


where, in our case, i = 0, 1, 2, • • • ; according to (9), *«,• vanishes 
unless i = n — 1 or f = n + 1, so that, for a fixed n, two terms, 
at most, on the right of (19) are different from zero, and (19) 
reduces to 

“I” r (20) 


now, again according to (9), vanishes unless m = n — 2 
or m = n, while Xn+i,m vanishes unless m = n or m = n~j-2; 
hence the only elements of which do not necessarily vanish arc 

' ^n,n-2 — l,n— 2 

2 

' ^nn = l^n-l,n ”1“ Xn,n+1 ^n+hn (21) 

2 

,2;n,n42 — ^n.n+l^n+l,n~i-2 • 


Substituting the appropriate elements of x from (9) into Equations 
(21) with due attention to the phase and time factors, we jfinally get 


ia^‘\/n(n — 1 )" 
a\n + 1) 

WV(n + l)(n + 2y' 
0 


if m = 71 — 2 
ifm = n 
if m = 7^ + 2 
otherwise. 


( 22 ) 


Incidentally, in the notation due to Dirac, the matrix element 
denoted above by is denoted instead by (n ( a | m); thus the 
nmth element of the matrix x is denoted by (n 1 a: | m), the nmth 
elemen^t of the matrix p by (n | p* | m), and so forth. The symbol 
(n 1 « 1 m) fits m with the symbols (x | n) and (m I x) for and 

& end of §27. In this notation the definition 

of a Heisenberg matrix element becomes 


(» I a I m) — J (n\ x)a(x | m) dx, 


<23> 



347 


THE HEISENBERG METHOD 


§62 


where a in the integrand is the Schroedinger operator associated 
with the dynamical variable a, and where {x | m) and (n j x) are 
normalized. 


Exercises 


1. Verify (22) by means of (2>. 

2. Compute the matrix (17) by matrix algebra from the matrices (9) 
and (14), using the equation H = 'p^l2m + 

3. Verify by matrix algebra that the matrices (9) and (14) satisfy the 
equation xp - px ih, 

4. Compute the matrix (14) by matrix algebra from the matrices (9) 
and (17), using the relation p = mx and the equation of motion 27*^. 

6. Set in ( 4 ) equal to yo — 71(7 + |x), where 70 and 7 are arbitrary 
real constants, and show that for large values of n the matrix elements (8) 
then become 


—i 


[ 0 





if r =» 1 

if r « -1 
otherw'ise. 


(24) 


Show also that with the same choice of phases the elements of the matrix p* 
pertaining to the oscillator become, for large values of n. 


Pn,n— r 


I mEn 

.0 


ifr« 2 
if 0 
ifr« -2 
otherwise. 


V25) 


6. Compute a few elements in the upper left-hand comer of the Heisen- 
berg matrix associated with the coordinate a; of a particle moving in a 
one-dimensional box extending from x ^ 0 to x ^ 1, and note that this 
matrix resembles 24®® and is quite different from (10). 


63. Heisenberg Matrices (Continued) 

We shall now enumerate certain further properties of the Heisen- 
berg matrices. In deriving these properties we shall use the 
definition 2®*; but note that these properties themselves, as well 
as some of those that were pointed out in the preceding section, 
can be described without reference to the Schroedinger method. 
As before, we restrict ourselves to bound one-dimensional motion. 

A. As we have seen in §14, the structure of the Schroedinger 
operator associated with a specific dynamical variable oc(z, p) is 
quite independent of the dynamical system in connection with 



348 


THE HEISENBERG METHOD 


§63 


which the operator is to be used; for example, the respective 
Schroedinger operators associated with the coordinate x and the 
momentum p of a particle are always x and -‘ihdfdXy whatever 
the dynamical system under consideration may be. On the other 
hand, the structure of the Heisenberg matrix associated with a 
dynamical variable oi{xj p) depends on the dynamical system under 
consideration. This property of the Heisenberg matrices, illus- 
trated in Exercise 6®^ follows from the fact that the t/^’s in 2 
differ from system to system. 

B. Taking the phase and time factors of the in 2®^ outside 
the integral sign, we find that the nmth. element of a Heisenberg 
matrix a has the form 






( 1 ) 


where anm is a numerical constant,^ the y^s are the arbitrary 
phases of the t//’s, and En and Em are the respective energies of 
the nth and the mth energy states of the system under consider- 
ation. Thus the nmth element of every Heisenberg matrix pertaining 
to a given dynamical system has the same phase factor ; the 

phase factors of the off-diagonal elements are interrelated, but are 
not fixed unless the arbitrary phases of the Schroedinger eigen- 
functions of H are j&xed by 'a suitable convention (we speak of 
this arbitrariness of the phase factors as their standard arbitrari- 
ness) ] the phase factors of the diagonal elements are equal to 1, 
so that these elements are free from any arbitrariness. Further, 
the nmth element of every^ Heisenberg matriq^ pertaining to a given dy- 
namical system depends on t through the same time factor 
this factor is periodic in t with frequency {En — Em)/hj which 
we denote by Vnm : 

Vn. = (En - Em)/h] (2) 

comparison with 4^^ now shows that the frequency of a (non- 
vanishing) off-diagonal element anm equals the frequency of the 
light that would be emitted in the transition of the system from 
the nth to the mth energy state; in particular, the diagonal ele- 
ments of a Heisenberg matrix are independent of t. 

It is the standard form of the phase and time factor of ocnm 
that permits us to abbreviate them without confusion by an 
apostrophe and a prime, respectively, and to write (1) as anm 


* Remember that the dynamical variables under consideration do not 
involve t explicitly. 



349 


THE HEISENBERG METHOD 


§63 


= anm^] in quantum-mechanical literature it is customary not to 
indicate these factors at all, but to have the reader keep them 
in mind. 

C. The Heisenberg matrix H associated with the Hamiltonian oj 
the system under consideration is a diagonal matrix whose diagonal 
elements are the energy levels of the system: 



if m == n 
if m 5 ^ n. 


(3) 


The proof is the same as that leading to 17®^ in the special case 
of the oscillator. 

D. Comparing the definition 2®^ with the Schroedinger expec- 
tation formula 2^^ and remembering that the in 2®^ are normal- 
ized, we find that 

Oinn — aVnO:, (4) 


that is, that the nth diagonal element of the Heisenberg matrix a 
is the expected average of the dynamical variable a for the rUh quantum 
state of the system under consideration. 

E. Using 2®^, we can write 6®^ as 




64t 


i 2/ 

€ {Vn 


3/lC* 




(5) 


so that the off-diagonal elements of the Heisenberg matrix associated 
with X are intimately related to the transition probabilities of the 
system under consideration. The off-diagonal elements of Heisen- 
berg matrices associated with djmamical variables other than co- 
ordinates do not have as direct a physical interpretation. The 
elements of all Heisenberg matrices are, however, asymptotically 
connected to certain classical quantities; this connection will be 
illustrated later in this section. 

F. A Heisenberg matrix <x depends in general on t] denoting it 
for the moment by a{t) and adapting to the case of matrices the 
standard definition of a time derivative, we dehne the time 
derivative of a as the matri^^ 


It follows that 


a = lim 

A^ >0 


a{t + A<) - a{t) 
At 


Onm = lim 


Onmit + AQ — Otnmjt) 
At 


d 

.. Oinmt 


( 6 ) 


(7) 



350 THE HEISENBERG METHOD 8^ 

where is the nmth element of the matrix a and d a„ Jdt ia 
the time derivative of «„>»; accordingly, the time derivative of a 
Heisenberg matrix a is the matrix whose elements are ine tvrm 
derivatives of the corresponding elements of oc. In view of (1) 

(2), Equation (7) becomes, in the case of Heisenberg matrices^ 


_ _ 2TriVnmCtnm- 

h 


G- Next we show that the time derivative, a, of a Heisenberg 
matrix a is ihT^ times the commutator of the Heisenberg matrices 
H and a, where H is the Heisenberg matrix associated with the 
Hamiltonian of the system under consideration. Indeed, taking 
the nmth elements of both sides of the matrix equation 


we get 


A = i (Ha - oH), 
n 




in view of (3) this reduces to 

_ Wrr rr \ _ — Em) _ 

Ontn ”” ^ \^nnOCnm — Onmrlmm/ — 


< 9 > 

( 10 ) 

( 11 ) 


and a comparison with (8) completes the verification of (9). 
Now, according to the equation of motion 27^, the right side of 
(9) is the operator to be associated with the time derivative of 
the dynamical variable a, and hence the Heisenberg matrix asso- 
ciated with the time derivative of a dynamical variable a is the time 
derivative of the Heisenberg matrix associated with the dynamical 
variable a. 

H. We shall now illustrate an asymptotic connection between 
the elements of Heisenberg matrices and the coejfficients of the 
classical Fourier expansions of the corresponding dynamical vari- 
ables. When the classical motion of a one-dimensional system is 
periodic with a period T, then every dynamical variable a per- 
taining to the system can be expressed classically as a Fourier 
series 




r • * ‘ 2 2, 1, 0, 1, 2, • - • , 


( 12 ) 



351 


THE HEISENBERG' METHOD 


§63 


where co = 2 t/T is times the fundamental frequency of the 
motion, and where ar is the (complex) amplitude of the rth 
harmonic of a; note that when complex exponentials rather than 
trigonometric functions are used in the Fourier series the har- 
monics are numbered • • • , — 2, — 1, 0, 1, 2, . For example, 

the classical Fourier series for the coordinate of an oscillator is 

(.m . = (i^ e-j e'’) e-; (13) 

in this c ase the amplitude of the harmonic numbered —1 is 
that of the harmonic numbered 1 is — i'\/jB'/2fce^'^, 
and the amplitudes of all other harmonics vanish, so that the 
various amplitudes can be listed as follows: 


Table 14 


r 

... 

-3 

-2 

-1 

!o 

1 

I 

3 

... 

Amplitude of the classi- 

• • « 

1 

0 

ii/^ e-'f 

y 2k 

<=> 

-i i /£ e'-’' 

r 2* 

<=» 

0 

. . . 

cal rth hannonic of x 









Let us turn next to the Heisenberg matrix representing the co- 
ordinate of the oscillator and consider those of its elements Xnm 
for which both n and m are large. These elements are listed in 
24®2, so that, disregarding the standard time factors, we get: 


Table 16 


r 

... 

-3 

-2 

-1 

1 

HDH 



■ 

iCn.fi-r for large n and 
apart from time factor 

... 

0 

0 

i i / is 

y 2k 




1 


Comparison of the two tables now shows that, when n is large, 
the matrix element is, apart from its time factor, precisely 
the classical amplitude of the rth harmonic of x, except that the 
E of Table 14 is replaced in Table 16 by jB;i- 
Similarly, for the square of the momentum of the oscillator, 
we get from 22^® and from 25®^: 







352 


THE HEISENBERG METHOD 
Table 16 


r 

... 

-3 

-2 

-1 

0 

1 

2 

3 

= 

Amplitude of the 
classical rth har- 
monic of 


0 

imEe-^r 

0 

mE 

0 

imEe^y 

0 

... 

pn.n-r for large n and 
apart from time 
factor 

1 

0 

i 

hmEne~^^'Tf 

0 

mEn 

0 

imE„e^^y 

0 

. . . 


The Mymptotic agreement between matrix elements and classical 
iJouner amplitudes is again apparent. 

The two examples given above illustrate the theorem that we 
shall call the correspondence rule for the Heisenberg matrices, a 
theorem statmg essentiaUy that for Urge values of n and smdU 
valms of r the matnx element of the Heisenberg matrix asso- 
ated with a dynamical variable a is simply related to the rth classical 
^rmoriic of a. The proof of this theorem will be omitted, since 
It employs advanced classical mechanics; nor shall we attempt to 
state the theorem more explicitly than we did above. Note that 
the preme apeement of, say, Tables 14 and 15 is due in part to 
the special choice of the 7 „’s adopted in Exercise 6“ 

The correspondence rule is in accord with the Bohr correspond- 

XrT? before the advent of quantum mechanics, 

which postulates asymptotic connections between atomic mechan- 
ics and classical mechanics in the region of the higher energies. 

thot 1 ^ fundamental r61e in the considerations 

at led Heisenberg to the formulation of matrix mechanics. 

Exercises 


pertaMneTo a L ^ a ^ Heisenberg matrices 

P“miaLm* = ^v®“ “ust satisfy the equations 

relations. ^ matrices 9«» and 14«! satisfy these 

upper lef/ha^*^(>om*° +u values of the elements in the 

EtoS ^ a Heisenberg matrix associated with the 

from 18“. ** e in a box. Note that this matrix is different 

■*>“"**■ '“»•“ >■. 

^ Wmonic oscillator the time 

a Heisenberg matnx element is the same as that of the 


353 THE HEISENBERG METHOD §63 


rth harmonic of the classical dynamical variable a; to what special feature 
of the energy spectrum of the oscillator is this coincidence due? 

5. Show that, if the motion of a system is bound, then the diagonal 
elements of the Heisenberg matrix p pertaining to this system are all zeros. 

6. Recall the criterion and show that, if the energy spectrum of a sys- 
tem is nondegenerate, the Heisenberg matrix associated with a constant of 
motion of the system is diagonal; see Exercise 7 for an example. Then 
use the rule for time differentiation of Heisenberg matrices to verify that 
the matrix associated with the time derivative of a constant of motion is 
the matrix zero. 

7. Compute the Heisenberg matrix representing the dynamical variable 
12 of §47 when the dynamical system under consideration is a linear har- 
monic oscillator. For what class of one-dimensional dynamical systems 
is the Heisenberg matrix 12 the same as it is for a linear harmonic oscillator? 

8. Show that the constants of motion of a bound one-dimensional system 
all commute with one another. 

9. Show that, if a system has noncommuting constants of motion, then 
its energy spectrum is degenerate. (On account of the result of Exercise 8, 
this theorem is of no interest in connection with bound one-dimensional 
systems.) 

10. Assume the matrix H to be degenerate (a case that does not arise 
in bound one-dimensional motion), recall the criterion and show that, 
although the matrix associated with a constant of motion need not then 
be diagonal, its time derivative, defined by (8), is nevertheless the 
matrix zero. 

11. Show that the definition 23®* of the Heisenberg matrix element 
(n I a I m), that is, anw , is equivalent to the following definition: 


(n I a 1 m) 



(» I p)a(p I m) dp, 


<14> 


where a in the integrand is the operator associated in the p-scheme with 
the dynamical variable oc, (p | k) is the normalized ^-function of the 
p-scheme describing the kth energy state of the system, and (Jo [ p) is, 
as usual, the complex conjugate of (p j ^). Hint: first show that the 
equivalence holds when cc is the coordinate x. 


64. The Heisenberg Method 

Having defined Heisenberg matrices through 2®^ we derived in 
the two preceding sections certain properties of these matrices, 
properties that can be described without reference to the Schroe- 
dinger theory. Now, it turns out that, if we should require a set 
of matrices to have these properties, then the conditions imposed 
on the matrices are so stringent that the matrices become uniquely 
determined, apart from just the standard arbitrariness of the 
phase factors of their elements. In other words, Heiaenherg 



354 


THE HEISENBERG METHOD 


§64 


matrices can he computed without reference to the Schroedinger 
theory j and, in particular, to This fact enables us to treat 
quantum-mechanical problems by the Heisenberg method, which 
we shall now describe (insofar as it concerns the computation of 
the possible values of real dynamical variables not involving t 
explicitly and pertaining to a conservative system) by stating the 
rules for associating specific operators with dynamical variables 
and the rule for inferring the possible values of a dynamical vari- 
able from the operator associated with it. 

A. The Heisenberg operators to be associated with dynamical 

variables are Hermitian matrices that fulfill the quantum conditions 
1^* and satisfy the time-derivative j sum^ and product relations satisfied 
by the corresponding dynamical variables themselves; further, the 
matrix associated with the Hamiltonian of the system is to be diagonal; 
and lastly, the nmth element of every matrix^ associated with a 
dynamical variable not involving t explicitly^ is to depend on t 
through the time factor the rule for time differentiation 

of the matrices is therefore 

A. — ”” 

N® / ^ OCntn f \I/ 

where the are the energy levels of the system, whose theoretical 
values are of course usually unknown at the beginning of a 
calculation. 

B. Once the matrices satisfying the conditions A have been 
found, the possible values of a dynamical variable are the eigenvalues 
of the rnatrix assodaied with it. This is the fundamental physical 
assumption of the Heisenberg method when this method is viewed 
apart from our postulate I of §13. 

The major computational problem in the application of the 
Heisenberg method to a specific dynamical system is usually the 
identification of the matrices to be associated with dynamical 
variables, that is, matrices satisfying the conditions A. In prac- 
tice we usually first choose the matrix H to be a diagonal matrix 
with undetermined diagonal elements, then associate with every 
remaining dynamical variable a matrix whose elements are unde- 
termined except for the form of their time factors, and finally 
force these matrices to satisfy the quantum conditions and the 
relations holding between the corresponding dynamical variables 
until their elements become fully determined, that is, until only 



355 


THE HEISENBERG METHOD 


§65 


the standard arbitrariness in the phase factors remains. This 
procedure must be carried out separately for each dynamical 
system. Incidentally, since the matrix H is required to be diag- 
onal, it follows from B that the diagonal elements of the matrix H 
are the energy levels of the system. 

In the light of our previous work the conditions A are not un- 
expected; but the considerations that led Heisenberg in his pioneer 
work to formulate an essentially equivalent set of conditions were 
of course quite different from those discussed above. 

66. Energy Levels of an Oscillator 

To illustrate the Heisenberg procedure we shall now use it to 
compute the energy levels of a linear harmonic oscillator. Ac- 
cording to the conditions A of §64, the Heisenberg matrices x, p, 
and H must in this case satisfy the special relations 

mx = — fca? {l) 

and 

H^^p^ + ^kx\ < 2 ) 

and also the standard relations 

p = m® (3) 

and 

xp — pz = ih; (4) 

further, the matrices must be Hermitian, the matrix H must be 
diagonal, and the time factors of the various matrix elements must 
be of the standard form, so that, in particular, x„i = 

— Ei)h ^Xni , 

£ni= -{En- (5) 

and 

Pnt = mi{En — El)h~^Xnl . (6) 

As noted in the preceding section, it is a claim of the Heisenberg 
theory that, if the matrices x, p, and H satisfy these conditions, 
then the diagonal elements of H are the energy levels of the oscil- 
lator. 

To simplify our problem, we shall assume outright that the 



356 


the HEISENBERG METHOD 


§65 


west level , so that the matrix H has the form 


■®o 0 0 0 

0 ^1 0 0 

0 0 0 
0 0 0 


(7) 


rtSL'iTa'vSrLtwo T? 

rth tanoL „, 5? “■= •=«« &« iS the elaeeioal 

matrix a: (that is, the elements a: elements of the 

classical rth han^onic ofT? 

elements of the matrix a: are als^nor^^ t 

19“ that follows from 


jajnz — 0 ifZp^nrbl 
1 ^ni 0 ifZ = ndbl, 


for lie taro't“aS“fta™“to,r S 
.poadeoce rule aiggeste that the matrk .te 


0 a^i 0 0 

»lfl 0 X]2 0 

0 *21 0 *2, 

0 0 a;j2 0 


C9) 


om ^^dSr^ed SXgelmcy oTth"* 

lar the mtrkff^ 

» tueh a wag .ha. .he taatricee .a.ie,g L 



357 


THE HEISENBERG METHOD 


above. The procedure is quite straightforward. According 
to (1) 

TI%Xn,n-A ~ — kXn,n~-l > 

which, on account of (5), becomes [k — m{En — h~'^]xn,n,-i 

= 0; therefore, in view of (8), k — m(En — En-iYh"^ — 0, and 

(En - = hVc , ( 11 ) 

where Vc is our usual abbreviation for the classical frequency of 
the oscillator. Taking square roots and remembering that, ac- 
cording to our labeling convention, En is greater than En^i , 
we get 

En — En-^l = hVe , ( 12 ) 

SO that 

En ^ IflhVe d" "^0 • (13) 

Thus a great deal of information about the energy levels (their 
spacing relative to each other) has been obtained almost at once. 

To complete the computation of the energy levels, it remains to 
evaluate Eo , that is, the matrix element Hoo • Now, according 
to (2), 

Hao == ^ Poo + ikxoo (14) 

where poo and ajoo are the 00th elements of the matrices p* and x^; 
hence 

1 

^00 = ^ ^iPoiPa + ifc i = 0> 1> 2, • • • . (15) 

In view of (8) and (6), the only nonvanishing tenns in the sums 
go with/ = 1, so that (15) reduces to 

Hoo = ^ PoiPio + ^kxoiXia . (16) 

On account of (6), this becomes 

Hoo = ^ [im{Eii — ,®i)h“^a:oi][f»w(JS'i — jBo)A“‘a:io] + ^TcxaiX^, (17) 
and, using (13), we get 

Hoo = — 5 -XoiXv) + iJfcOkaaJio = kZeiXu. ( 18 ) 



368 THE HEISENBERG METHOD §65 


To evaluate Xoia:u we turn to (4), which has not yet been 
used. We have {xp — 'pz)w — ih, that is, 

HiXoiPjd - 'Z.iPoiXja = ih, j = 0, 1, 2, • • • . (19) 

The only value of j yielding nonvanishing terms is again 1, so that 


XoiPia - PoiXio = ih. (20) 

Using (6) and (13), we convert this into Xoi(2rimveXio) 
— (— 2«mj'cXoi)a:io = ih, and therefore 


h h 

iCoi 3JlO ““ ^ ■ ■■ . 

^Trmvc 4:T’y/km 


( 21 ) 


Equation (18) now yields Hqq = ^hve , and it follows from (13) that 


= (?^ + mvc, n = 0, 1, 2, . . . , (22) 

in agreement with the result obtained in §15 by the Schroedinger 
method and in §51 by the p-method. 

The reader will have noted (a) that our acquaintance with the 
classical theory of the oscillator, which was of no assistance ii. 
computing the energy levels of the oscillator by the Schroedinger 
method, has been of great help in the work of this section, since 
it enabled us through the correspondence rule to determine from 
the outset the general form of the Heisenberg matrices; and (b) 
that the mathematical methods of this section, being on the whole 
purely algebraic, are somewhat simpler than in the Schroedinger 
case. For reasons such as these a person intimately acquainted 
with the classical theory of a djmamical system and skilled in 
judicious application of the correspondence rule will sometimes 
prefer to make calculations regarding the corresponding quantum- 
mechamcal system directly by the Heisenberg method and without 
reference to the Schroedinger theory. 

•Heisenberg matrices, other than JT, pertaining to the oscillator 
are also readily computed without the aid of Schroedinger func- 
tions by extending the calculations given above. For example, 
since x must be a Hermitian matrix, we have a;io == xti , so that 
XqiXk^ = I xoi P, and (21) becomes 


I icoi I' = W, (23) 

where a is giv^ in 5®^. Hence the modulus of rcoi is 2~* a, and xoi 
differs from 2 a by a factor of the form e^, where d is real; this 



359 


THE HEISENBERG METHOD 


§66 


factor must include the standard time factor of Xoi and must 
therefore be of the form where y remains arbitrary. 

We may, if we wish, set 7 = ti — To , where 70 and 71 are arbitrary 
real constants, and write 


Xq] 


V2 



(24) 


This result is in agreement with the middle line in 8®^ for n = 0, 
The calculation of the matrix x will be completed in Exercises 
2 and 3. 

The computation of the Heisenberg matrices for the oscillator 
by the Heisenberg method becomes of course more involved when 
carried out without the aid of the correspondence rule. On the 
assumption that the energy spectrum is nondegenerate, it is not 
difficult to show that every row or column of the matrix x con- 
tains, at most, two nonvanishing elements, but the identification 
of the positions of these is somewhat tedious.® 

Exercises 

1. Derive (13) by equating the (n, n -1- l)th elements [rather than the 
(n, n — l)th elements] of the two sides of (1). 

2. Equate the nth diagonal elements of the two sides of <4> and show, 

using (8), that | ]* - | ajn.n-i |* “ §a*; then show that | = 

i(n -h l)a*. 

3. Deduce 8®’ from the final result of Exercise'2. 

4. Equate the nZth elements of the two sides of (1) and show, assuming 
nondegeneracy of the energy spectrum, but not using the correspondence 
rule, that every row as well as every column of the matrix a; for the oscil- 
lator contains at most two nonvanishing elements, 

6. Recall Exercise 6*® and show that, since (4) holds for any dynamical 
system, all Heisenberg matrices have an infinite number of rows and 
columns. 

6. Equate the nnth elements of the two sides of (4> and show that every 
row as well as every column of the Heisenberg matrix a? for any dynamical 
system contains at least one nonvanishing element. 


*See for example, G. Birtwistle, The New Quantum Mechamc$j Chapter 
X; Cambridge: The University Press, 1928. Or M. Born and P. Jordan, 
Elementare Quantenmechanikj §23; Berlin: Springer, 1930. Born and Jor- 
dan consider the problem with a minimum of assumptions and show that 
the Heisenberg method leads to unique matrices for the oscillator. 



360 


THE HEISENBERG METHOD 


66. Diagonalization of Matrices 

In this and in the following section we shall take up certain 
mathematical preliminaries to the Heisenberg perturbation theory. 

Frame changes. If we begin with an analytical description of 
vectors and mappings of a space in terms of some coordinate 
frame P and then go over to their description in terms of another 
frame Q, we say that we have carried out a transformatioTi of co^ 
ordinates or a frame change. Our immediate problem is to set up 
a convenient analytical method for describing frame changes. 


We start with a two-dimensional 
example and indicate in Fig, 98 a 
plane» an arbitrary vector u in this 
plane, and the two coordinate frames 
P and Q of Fig. 87®« and Fig. 91«; the 
process of substituting the frame Q 
for the frame P will be called for 
brevity the frame change S. 

A frame change (unlike a map< 
ping) does not affect the vectors of our 
plane; for example, the vector u in 
Fig. 98 remains the same whatever 
nlane , frame may be used in describing the 



u? 




^ «f H — ^ u? 

V2 ' ^ V2 


( 1 ) 


foTth? coefficients on the right eide of 

I ~ ~ 

^ „ I V2 V2 

\-~ -i- 

\ V2 

coluM s^bol ^th**componente denote a 

symbol with components uf and it«* we m.* ’ ** denote a column 

by wnting u9 » “ay now symbolize Equations (1) 


( 2 ) 



361 


THE HEISENBERG METHOD 


§66 


If M is an arbitrary vector in an iV-space, and if its components 
M? in a frame Q are related to its components uf in a frame P 
through the equations 


Ui = Sliwf + 512^2 “H • • 

• + 


^2 ~ 5211^1 + 522^2 + * ' 


(3) 

y>N = + Sif%U2 + • • 

• + SknU^, 


then we adopt the matrix 



/ 5ii 5 i2 • • * 

Sin\ 


^ I ^21 Sw * • ‘ 

SiN j 

(4) 

\stfi Sy 2 

Snn/ 



as the analytical description of the transformation from the frame 
P to the frame Q and, denoting the column symbol with compo- 
nents Ui by w® and the column symbol with components «f 
by , we symbolize Equations (3) by writing 

= 8u^. (5) 

The effect of frame changes on representatives of mappings. 
Let a mapping a of a space send an arbitrary vector u into the 
vector V, so that 

aw = V, (6) 

and let this mapping and the vectors u and v be represented in a 
frame P by the matrix a** and column symbols and v’’, re- 
spectively, so that 

(7) 

further, let this mapping and the vectors u and v be represented 
in another frame Q by the matrix and column symbols w* 
and v^, respectively, so that 

a^w^ = (8) 

Our problem is to find the relation between a® and when the 
transformation from the frame P to the frame Q is described by a 
matrix 8, so that 

w® = Su^ (9) 

w® *= 8v^. (101 



362 


THE HEISENBERG METHOD 


Elimina ting u'^ .and from (8) by means of (9) and (10), 
we get 

a'^SvT = S/, (11) 


that is, 


= Sa^u^ 


( 12 ) 


Since the vector u is arbitrary, the column symbol in (12) has 
arbitrary components; recalling Exercise 8^*, we then conclude 
that 


cc^S = Scc^. 


(13) 


If jST^ exists, we get from (13) the equation 

«« = Sa^S-\ (14) 

which enables us to compute a® in terms of and S. 


To llluatrate, we return for a moment to the reflection y discussed in 
§64; this reflection is represented in the frame P of Fig. 98 by the matrix 

(16«) o)' 

We shall use (14) to find the matrix 7 ^ representing the same mapping but 
constructed relatively to the frame Q. The frame change from P to G 
is in this case described by the matrix ( 2 ), so that according to 15®® 






Equation (14) now yields 
/ 

7 « « = 



(16) 



in agreement with 7®®. 


From the quasi-geometric standpoint that we have employed 
in this section, it is of course obvious that the matrices and 
</ have the same eigenvalues, namely, the eigenvalues of the 
one mapping that both matrices represent. To prove (algebra- 
ically, and without ascribing to the matrices any geometrical 



363 


THE HEISENBERG METHOD 


§66 


significance) that, if the matrices a®, </, and S satisfy (14), 
then every eigenvalue of a is also an eigenvalue of a®, we may 
proceed as follows: Let a^, the constant X, and the column symbol 
satisfy the equation 

a^vT = Xti', (18) 

so that X is an eigenvalue of a. Construct the column symbol 
Su^, denote it, say, by w®, and operate on it with the matrix a®. 
The result is 

a®M® = = S</S-^Su’' = (19) 

so that, in view of (18), a®«® = SKu^ == XSw^, that is 

«®«® = Xu®, (20) 

and hence X is an eigenvalue of a®. 

If three matrices, a', a, and are related through the equation 

( 21 ) 

that is, through an equation of the form (14), then a' is called the 
transform of « by jS, /3 is called the transformation matrix, and the 
process of constructing a' from a and /3 is called a similarity 
transformation; similarly, the matrix a" = yoqT^ is called the 
transform of a by t, and so on. We have shown above, quite 
apart from any quasi-geometric significance that may be attached 
to matrices, that the eigenvalues of a transform of a matrix are the 
same as those of the matrix itself. This result provides the clue 
for our interest in the topics now under discussion, because it 
shows that the computation of the eigenvalues of a matrix can, 
by a similarity transformation of the given matrix, be reduced 
to the computation of the eigenvalues of another matrix, which 
might perhaps be easier to handle than the given matrix. 

Unitary frame changes. Let u, v, •••, z he N orthonormal 
vectors in an fV-space, represented in a frame P by the column 
symbols 



so that we have the N equations (recall 13'’ and 14") 

Si ut Ui = Vi = . • • = “ 1 


( 23 ) 



364 


THE HEISENBERG METHOD 


§66 


and the — 1) equations 

Vi " Wt 

W,* Wi 


• • = = 0 
• • = 23.- ».* Zi — 0 (24) 

£*j/*2. = o> 

and let 3 be the special frame change described by the matrix 


(25) 


Our present problem is to identify the frame Q that the special 
frame change S substitutes for the frame P. 

Operating with the matrix jS in turn on each of the s 3 'mbols (22), 
we get the column symbols 


f * 

Ui 

aN 

1^2 • • * 

U*\ 

* 


* 1 

^1 

V2 

Vn j 

* 


J 

,«1 

Z2 

Zy/ 




(26) 


1 ^°+^ according to (5), the symbols (26) are the representatives 

shnwQ tiTT of our vectors «,«,•••, 2 ; and inspection of (26) 
shows that Q IS that frame whose basic unit vectors lying along 

the «« ft , a „e just our unit veuto.? 

descrihino-'fl^+r ^ enables us to construct the matrix 

another ^ orthogonal frame to 

scribed o!S T® a Pre- 

scnbed onentation relative to the first. 

onal^W to another orthog- 

describinesueb a matrix 

frame change and^T^i ormation are called, respectively, a unitary 

The complex conj^ate of (25) is ' ^ ’^“^tary matrix. 



365 


THE HEISENBERG METHOD 


and we find, using (23) and (24), that 

SB = 1, (28) 


that is, that 

/S'* = B. (29) 

Equation (29), which can be shown to imply the ^N(N + 1) 
equations (23) and (24), provides a succinct algebraic definition 
of a unitary matrix: a matrix is unitary if its complex conjugate 
equals its reciprocal. 

Diagonalization of Hermitian matrices. Let a mapping a be 
represented in a frame P by a Hermitian matrix 


P 

a = 


ceil 

CClff 

^OCni 

• • • 




(30) 


and let Q be an orthogonal frame whose axes lie along the eigen- 
vectors of a. We shall show that the matrix a® representing the 
mapping a in the frame Q is diagonal. 

Let the unitary eigenvectors^ of a belonging to its eigenvalues 
Xi , Xj , • • • , Xjir , be represented in the frame P by the column 
symbols (22), so that 


::: 

\aifi ,• • • 

( ecu • • • 

ccia * • • 



(31a) 

(31b) 


and so forth, or, more explicitly, 

= \iUk, » (32) 


^ As noted in §68, any two eigenvectors of a Hermitian matrix belonging 
to different eigenvalues of the matrix are orthogonal. If o'* is nondegen- 
erate, the mapping a has just N eigenrays, and each of its normalised 
eigenvectors is undetermined only to the extent of a numerical factor of 
modulus unity, which we absorb in the elements of (22), If is degen- 
erate, the mapping a has an infinite number of eigenrays; we then restrict 
ourselves to eigenvectors lying along N mutually orthogonal eigenrays; 
it is immaterial for our purpose how these N rays are chosen. 



366 


THE HEISENBERG METHOD 


§66 


where fc = 1, 2, • • • , JV. Our task is to find, with the help of (32), 
the matrix representing « in the frame whose basic unit vectors 
are represented in the frame P by the column symbols (22), so 
that the transformation matrix and its reciprocal are just (26) 
and (27); Equations (24) are of course consistent with the fact 
that a is Hermitian. Using (14), we get 


a® = 



ulMav. 


ot-in 


Ml 


z*/\ain 


OLKn/ X'WiV- 


OL\N \ f Ul 


am/ \Ujf 


Zi 


Zur , 




Ml 


and hence 


2 \Xl Mjv 


a® = 


f * 
Ml 


* 


\ /Xl 2Zi • • - \nZ\ 


} VXiUaT 


\nZnI 



that is, 


^Xi 0 

a® = I 0 X2 

iO 0 




(33) 


(34) 


(35) 


(36) 


The process of subjecting a matrix to a similarity transforma- 
tion such that the transform of the matrix is diagonal is called 
d^onalization of the matrix or, even in purely algebraic discus- 
sions, transformaiion of the matrix to its principal axes. A simi- 
lanty transformation employing a unitary transformation matrix 
w caUed a unitary transformation. We have shown above that a 
Henmtian matrix can be diagonalized by means of a unitary 



367 


THE HEISENBERG METHOD 


transformation, and that the appropriate transformation matrix 
can be constructed as soon as the eigenvectors of the given matrix 
have been computed. 


To illustrate, we shall diagonalize the matrix given by (15), Accord- 
ing to Table 12®^ the normalized eigenvectors of belonging to its respec- 


, where the 5’s are ar- 


/l\ / 1 \ 

tive eigenvalues 1 and — 1 are (] and f j , 

bitrary real constants whose values are immaterial for our purpose; to 
save writing, we let 3 1 * 0 and 3a =* tt, getting the particular unitary 
eigenvectors 



(37) 


Using (37) to construct the transformation matrix S after the manner in 
which (25) is constructed from (22), we find that S is just the matrix (2). 
That the transform of by this S is diagonal is seen from (17). 


Summary of algebraic results. So far as strict matrix algebra 
is concerned, we defined in this section the terms transform of a 
matrix, transformation matrix, similarity transformation, unitary 
matrix, unitary transformation, and diagonalization of a matrix; 
and, restricting ourselves to matrices with a finite number of rows 
and columns, we proved algebraically that the eigenvalues of a 
transform of a matrix are the same as those of the matrix itself, 
and that a Hermitian matrix can be diagonalized by means of a 
unitary transformation. 

The process of diagonalization throws a matrix into a form in 
which its eigenvalues are displayed in full view (Exercise 3®^); 
nevertheless, in most cases, this process is not useful for the 
computation of eigenvalues, because to set up the appropriate 
transformation matrix we must know the eigenvectors of the 
given matrix, while to compute the eigenvectors it is necessary 
first to find the eigenvalues. But the process of diagonalization 
is of computational value in certain approximate calculations to be 
considered in the next section. 


Exercises 


1. Show that a counterclockwise rotation of a rectangular coordinate 
frame of a real plane through an angle about 0 is described by the matrix 


/ cos *i> sin \ 
\— sin ^ cos 


Contrast this matrix with that of Exercise 5®®. 




368 


THE HEISENBERG METHOD 


2* Use (14) to compute each of the matrices 7®® from the corresponding 
matrix in 16*^ 

8. Diagonalize the matrix of Exercise 8®*. 

4. Show that every algebraic relation that holds between two or more 
matrices holds also between the respective transforms of these matrices, 
provided each matrix is subjected to the same similarity transformation. 

6. Show that, if two matrices do not commute, then they cannot be 
diagonaliaed simultaneously, that is, by means of the same similarity 
transformation. 

6. The transformation from a frame P to a frame Q is described by a 
matrix S\ the transformation from the frame Q to a frame R is described 
by a matrix P; compute the matrix describing the transformation from the 
frame P directly to the frame R, 

7. Show that a product pf two unitary matrices is unitary. 

8. Show that the transform of a Hermitian matrix by a unitary matrix 
is Hermitian. 

9. Show that the elements of (25) satisfy the equations 


uj Ui 4" ’ * • 4“ 


r 1 if fc « i 
\0 if fc ^ I 


(38) 


Note the appropriateness of saying that the rows of a unitary matrix are 
ortho normal, and that so are its columns. 

10» Show that, if a unitary matrix /S commutes with a nondegenerate 
diagonal matrix, then S is diagonal and each of its diagonal elements is 
of modulus unity. 

11. Diagonalize the matrix in 15®*; the procedure developed in the 
text is applicable in this case, since a Hermitian matrix can be formed by 
multiplying by Reconcile the result of the diagonalization with 
that of Exercise 6®®. 

12. Show that the condition (29), when imposed on the matrix (27), 
implies the iNiJSf 4* 1) equations (I^) and (24). 


67. Approximate Diagonalization of Matrices 

If a matrix a has the form 

oj = a® 4- + • • * > 

where a is diagonal, so that 

anm = 0 if n m, (2) 

where a, cf, and so on, are matrices not involving e, and where e 
is a sufi&ciently small numerical parameter, then we call a a 
perturbed matrix, call a the unperturbed matrix, and call 



369 THE HEISENBERG METHOD §67 


4- + • • • the perturbing matrix or the perturbation. 

if € is small, then the matrix 


1 +Ze 2e \ 

2€ 4 + €7 


Thus, 


(3) 


is an example of a perturbed 2-by-2 matrix, the unperturbed 

matrix being a® == Q perturbation being = 

/3 2\ 2/0 ..... 1 . 

^ V2 0/ ^ \0 1/ purpose of this section is to show how 


matrix diagonalization can be used for approximate calculation of 
the eigenvalues of a Hermitian perturbed matrix without explicit 
use of the secular equation. 


We begin with a few remarks concerning the eigenvalues of the particular 
matrix (3). If < is entirely negligible, then (3) reduces to the unperturbed 
matrix 

c :) 

whose eigenvalues are 1 and 4; thus to the zeroth approximation the eigen- 
values of (3) are 1 and 4. If c is small enough to permit neglecting the 
terms in but not small enough to permit neglecting the terms in 
then (3) goes over into 

/l + 3« 2t 
\ 2« 4 

this matrix is not diagonal, and we are not prepared to determine its eigen- 
values by inspection. 

Let us now compute, correctly to terms® in 6®, the transform a' of (3) 
by the special matrix 

i /S «* 1 + where 8^ » ; (6) 

the reason for choosing this particular 8 will appear presently. Since « 
is small, the reciprocal of (6) can be found by the formula (I -h 3)"^ « 
1 — 5 4- 5* — • * - , so that, correctly to terms in <*, we have «= 
1 - eSfi + that is. 



* ‘Correctly to terms in * means that the terms in e, are 

correct, while the terms in and so on, are not necessarily correct. 



370 


THE HEISENBERG METHOD 


§67 


Thus 


s«s-‘ 


-i‘YH-3* 2* i. \ (8) 

\h 1 A 2 * 4 + .»A -f« l-W 

When we carry out the matrix multiplication and neglect powers higher 
than 6^ in the result, (8) reduces to 


/I + 36 - 2e* \ 

V 26* 4+|6V 


( 9 ) 


Comparison of (9) with (3) shows that the effect of our transformation 
has been to cause linear terms in e to appear only in the diagonal elements 
or, as we say for short, to diagonalize (3) to terms in e*. If €* can be neg- 
lected, (9) becomes the diagonal matrix 


AH- 36 0\ 

V 0 4/ 


( 10 ) 


whose eigenvalues are 1-1-36 and 4. Recalling that the eigenvalues of a 
transform of a matrix are the same as those of the matrix itself, we con- 
clude that, correctly to terms in e*, the eigenvalues of (3) are also 1 -h Se 
and 4. 

Since terms in 6* appear off the diagonal in (9), we are not prepared to 
determine by inspection the eigenvalues of a correctly to terms in e*. A 
similarity transformation can, however, be found which diagonalizes (3) 
to terms in 6*, and then the eigenvalues can be obtained by inspection 
correctly to terms in 6*. The remainder of this section will deal with a 
systematic procedure for such approximate computations. 


Let jS be an as yet undetermined unitary matrix of the form 

s ^ , (11) 

where the superscripted S’s are matrices independent of e; let 
be our perturbed Hennitian matrix (1) ; and let us try to adjust 
in such a way that the matrix 

A = SaS-^ (12) 

is diagonal, that is, that 

Anm — Oiin 7 ^ m. (13) 

Since A is a diagonal transform of a, the diagonal elements of A 
are the eigenvalues of a, so that if these eigenvalues are appro- 
priately numbered, we have 


■^nn — J 


( 14 ) 


Cq ft 



371 


THE HEISENBERG METHOD 


§67 


■fre assume that each eigenvalue is a power series in «, so that A 
has the form 

A = A eA^ e^A^^ (16) 

where the superscripted 4’s are diagonal matrices independent 
of e. 

To find how S is to be chosen, we multiply (12) by S from the 
right, getting 

AS = See, 

that is, 

+ eA^ + + . . .)(-S“ + eS^ + ^S^^ + ■ • •) 

= (fio + + . . .)(«« + ea' + aV" + 


•). 


(16) 

(17) 


and combine like powers of a: 

eU'S^ + A^S’‘) 
6*(A‘'5" + A^S^ + A“^) 


/S®«“ 

€(/S°a^ + jS^a®) 
6*(S“a" + S^a^ + S^^a") 


Thus, if (16) is to hold identically in e, the following sequence of 
matrix equations must be satisfied: 

(19) 

-f (20) 

+ A^S^ + A^^S^ = ( 21 ) 


. The only quantities in these equations which are known from 
the outset are the a:\s. The equations, however, enable us to 
set up one after another the matrices -4®, A^j and so on, 
and, through the -4^s, to determine the eigenvalues of a correctly 
to any desired power of 6. 

We first turn to (19). If e 0, then a a and A -> A^, 
so that, since the eigenvalues of A are the same as those of a, 
the eigenvalues of A^ must be the same as those of a] and, since 
both and are diagonal, these matrices can differ only in the 
order of their diagonal elements. We can without loss of gener- 



372 THE HEISENBERG METHOP §67 

ality require that the diagonal elements of A® be arranged in the 
same order as those of a®, in which case 

A® = a®. (22) 

and (19) becomes 

a®-S® = S®«®. (23) 

Now, if e — > 0, then S — » iS®, so that, since S is to be unitary for 
any value of e, iS® must itself be imitary. It now follows from 
(23) and the result of Exercise 10“ that, if the unperturbed matrix 
a® is nondegenerate, then S® is a diagonal matrix each of whose 
diagonal elements is of modulus unity. We adopt the simplest 
possibility, namely 

S® = 1. (24) 

We shall see presently that when a is degenerate this choice of 
the unit matrix for S® is not satisfactory. 

Having determined A® and 51®, we turn to (20), which now has 
the form a®5* + A^ = + SV, that is, 

a®/S^ - S*«® = - A^ (25) 

Equating the nnth elements of the two sides of (25), we get 

Ei «®ni-SL - - Ain . (26) 

Since a® is diagonal, the left side of (26) is a® — <S^n«nn = 0, 
and hence 

AL»«L. (27) 

If powers of « higher than the first are neglected, (15) reduces 
to A — -4® + so that (14) becomes Xn = Ann + eAnn, or, 
according to (22) and (27), 

Xn = Q!nn + €a]^n . (28) 

Thus, if the unperturbed mcdrix is nondegeneratej then the diagonal 
elements of the perturbed matrix give the eigenvalues of the perturbed 
matrix correctly to terms in the first power of e. For example, we 
can now tell by inspection that the eigenvalues of (3), correct to 
terms in c\ are 1 + Be and 4. 

A calculation of jS^ from (20), followed by a calculation of A^ 
from (21), yields the next approxiniation for the eigenvalues : 

x„ = «®„ + 

L «„» — ttiij 


(29) 



373 THE HEISENBERG METHOD §67 

where i runs through the values 1, 2, • . • , n — 1, tz + 1, * • • , iV; 
the prime after the 2 is a reminder that i skips the value n. 

When the unperturbed matrix a is degenerate, the formula (29) 
acquires vanishing denominators, indicating that our procedure 
then needs modification. A closer study, which we shall not 
undertake here, shows that, in the degenerate case, can and 
must be chosen to be nondiagonal, and that new formulas take 
the place of (28) and (29). In particular, if the unperturbed 
matrix is degenerate it is no longer possible to find the eigenvalues 
of the perturbed matrix correctly to terms in e by merely inspecting 
its diagonal elements. 

Formulas for the eigen- 
values correct to any 
power of € whenever the 
process is convergent are 
given, for example, by 
Condon and Shortley, 
page 34. 

In their detailed presenta- 
tion of the matrix perturba- 
tion theory, Condon and 
Shortley consider as an il- 
lustration the particular 
matrix 

( 10 + 6e 0 lOA 

0 10 30«| (30) 

10« 30e 0 / 

and plot its precise eigen- 
values as functions of < in a 
graph reproduced® in part in 
Fig. 99. The unperturbed 
matrix in (30) is degenerate, 
its eigenvalues being 10, 10, 

and 0; but for € 0 the matrix has three distinct eigenvalues, and we say 

that the perturbation has removed the degeneracy. 

Exercises 

1. Use (29) to show that the eigenvalues of (3), correct to terms in e*, 
are 1 -h 3« - and 4 -f It*. 


* By permission of The Cambridge University Press and The Macmillan 
Company, publishers. 



Fig. 99. The eigenvalues of (30), plotted 



374 


THE HEISENBERG METHOD 


IVi-CLi _1 n I 1 1 J §67 

tni!,’ eigenvalues of (3), using the secular equation; 

then allow e to be small, substitute a power series for the radical in the 
precise expression for the eigenvalues, and verify the result of Exercise 1. 

а. use (29) to compute correctly to terms in e‘ the eigenvalues of the 

/ 1 e 4e> 

matrix I e 2 « 

\4€ € 3y 

using the secular equation, that the eigenvalues of the matrix 

ij ^ * ®'Ud 1 — 6. Why is (29) not applicable to this case, even 

when € is small? 

б. Show that in view of (26) and (27) the matrix can be taken to be 


Use 10®* and 11®* as partial checks. 


^nm 


cf 

nm 


- a" 


if TW ** n 




( 31 ) 


7’ '"^®“ ’‘®®‘^ yi®i‘i® oi (6)- 

nf ('n ^ is nondegenerate, the similarity transformation 

term S' J ^ (D *<> 

8. Derive (29). 

68. The Matrix Perturbatioii Method for Nondegenerate States 

^ we return to physics, we recall that the structure of the 
^eisenberg matrix associated with a dynamical variable a(x, p) 
depends on the dynamical system under consideration: for ex- 
ample, as noted in.Exercise 6“ the matrix associated with the co- 
^ *ii^ matrix when the dynamical system in question 
IS the oscillator, and quite another matrix when the system is a 
particle m a box. We also recall that the Heisenberg matrix H 
associated mth the Hamiltonian of a system is diagonal and has 


( 3 «) 


— 


if W — ?l 
if w 5*^ n, 


( 1 ) 


r the re- 

^ctive He^enberg energy matrices pertaining to two different 

tL <hagonal and differ from each other only through 

■ values of their diagonal elements— recall Exer- 

C1S0 ^ • 



375 THE HEISENBERG METHOD §68 

Let the Hamiltonian function of some specific dynamical 
system be 

= ( 2 ) 

and let the Heisenberg matrices (which we denote by a?, 
and so on) associated, respectively, with the coordinate, the 
momentum, the Hamiltonian, and so on, of this system be known. 
The availability of these matrices is then usually of no assistance 
in the computation of the matrices x, p, and H, associated, re- 
spectively, with the coordinate, the momentum, and the Hamil- 
tonian of another specific dynamical system whose Hamiltonian 
function is 

H = (3) 

for example, the availability of the various Heisenberg matrices 
pertaining to the linear harmonic oscillator is of no assistance in 
computing the various matrices pertaining to a particle in a box. 
An exceptional case arises, however, if 

(1^®) V{x) - -h V^ix), (4) 

where the term V^{x) is small, that is, if the second system differs 
from the first only through a small perturbing potential; in this 
case the knowledge of the matrices pertaining to the first (the 
unperturhed) system is very useful in the treatment — at least the 
approximate treatment — of the second (the perturbed) system. 

We now proceed to show how the Heisenberg matrices pertain- 
ing to the unperturbed system can be used for an approximate 
computation of the energy levels of the perturbed system. Our 
first steps are to note that the Hamiltonian functions (2) and (3) 
satisfy the relation 

H ^ H^+ V^{x) (5) 

and to set up the auxiliary matrix 

H = + V^{x^), (6) 

where is the energy matrix of the unperturbed system, is 
the coordinate matrix of the unperturbed system, and the matrix 
V^{x^)is the same function of the matrix as the perturbing 
potential V^(x) is of the coordinate x. 



376 


THE HEISENBERG METHOD 


To illustrate : Let the unperturbed system be a linear harmonic oscillator 
and let the various Heisenberg matrices which pertain to it and which we 
distinguish by the superscript zero be available; in particular, the matrices 
lO** and 18“ that is, 


xP 0 VT" 0 ••• HP 

^/i’' 0 V2" ■■■ 

0 0 


are then available. Further, let the perturbing potential have the special 
form V^(x) ■» tkz. Then 


ihve 0 0 • • • (7a, b) 

0 ihp. 0 ■ • • 

0 0 ffevc * * * 





0 

VI" 


0 


VI" 0 ••• 

0 V2" • • • 

V2” 0 


< 8 > 


80 that the auxiliary matrix H is 



The energy matrix H of the perturbed system is not available^ although 
we know that it is diagonal. Our purpose is to show how H can be approxi- 
mated with the help of the matrix H. 


To establish the property of H which is responsible for our 
interest in this matrix, we proceed as follows: According to the 
fundamental formula 2®^, the elements of the matrix H (the 
Energy matrix of the perturbed system) are 

H nm — / (10) 

where H in the integrand is the Schroedinger operator associated 
with the Hamiltonian of the perturbed system and the are 



377 THE HEISENBERG METHOD §68 

normalized eigenfunctions^^ of this operator; in view of (5), 
Equation (10) can be written as 

( 11 ) 

where is the Schroedinger operator associated with the Hamil- 
tonian of the unperturbed system. Similarly, the elements of the 
matrix (the energy matrix of the unperturbed system) are 

f ( 12 ) 

where in the integrand is the Schroedinger operator associated 
with the Hamiltonian of the unperturbed system, and the 
are normalized eigenfunctions^® of this operator. Again, the 
nmth element of the matrix (the coordinate matrix of the un- 
perturbed system) is and the nmth element of the 

matrix V^{z^) is “ 

f (13) 

In view of (6), (12), and (13), we thus have the result 

Hn. = / [H® + V^’ixWr.dx. (14) 

Now, according to (ll), the matrix H is the representative of 

the Schroedinger operator H® -t- V^{z) when the basic fs are the 
Schroedinger eigenfunctions of the perturbed Hamiltonian. Simi- 
larly, according to (14), the matrix H is the representative of 
the same Schroedinger operator when the basic fs are the Schroe- 
dinger eigenfunctions of the unperturbed Hamiltonian. Thus the 
matrices H and H represent the same differential operator, though 
with respect to different sets of basic fs; and, since the eigen- 
values of any matrix representing a differential operator are the 
same as those of the differential operator itself, we conclude that 
the eigenvalues of the matrix H are the same as those of the matrix H. 
In other words, the energy levels of the perturbed system are the 
eigenvalues of the auxiliary matrix H. Since we have not yet 

Insofar as our argument is concerned, it is quite immaterial whether 
the explicit forms of these eigenfunctions are available or not. 

“ In practice we would perhaps prefer to compute the matrix 
by matrix algebra in terms of the matrix x®, rather than by an extra inte- 
gra ' n; but for our present purposes it is important to note that the 
nm element of y^(x®) can be expressed in the form (13). 



378 


THE HEISENBERG METHOD 


used the assumption that V^{x) is small, this conclusion is quite 
general. 

Perturbation formulas. We now turn to the case when the 
energy spectrum El, El, • ■ • , of the unperturbed system is 
nondegenerate and when the perturbing potential can be expanded 
into a power series in a small parameter e, so that 

(12“) H + eV\x) +.•••, (15) 

where the »’s are functions of x independent of e. The auxiliary 
matrix H can then be written as 


H = + ev\x^) + eWHx') + • • • . (16) 


The matrix H thus 'has the form 1®^, so that its eigenvalues 
(which, as we have seen, are the same as those of H) can be ap- 
proximated by means of 29^^. The correlation of the symbols of 
the present section with those of §67 is as follows : a = H, a® = 

and so on; Xn in §67 denotes the ?^th 
eigenvalue of a (that is, of H), and since the eigenvalues of H 
are the energy levels of th^ perturbed systena we write Xn = En ; 
further, since the diagonal elements of are the energy levels 
of the unperturbed system, we write a°n = El . When we sub- 
stitute the new symbols into 29®^ and, to simplify appearances, 
suppress the argument 7^ of the y^s, we get the formula 


Bl + €Z)L + € 


lv = v(x^)l (17) 

The supplementary equation = v(x°) in (17) is a reminder that 
»„« st^ds for the nnth element of the matrix vli stands for 

^ before, the 

in th e 2 means that the term f = n is to be omitted 

in tne summation. 

P^'^^bation formula of matrix mechanics 

in e* DroviH^T^ Perturbed system correctly to terms 

, provided the unperturbed system is nondegenerate 

the oscillator perturbed by 

rices involved in (17) therefore are [remember 



379 


THE HEISENBERG METHOD 


that we write v for = fcrc® and = 0. The matrix shown in (7), 

has the elements 9®*, so that 

if w =* n — 1 

^ AjaVR^+T)'' if m = n + 1 (18) 

^ 0 otherwise, 

while - 0 for all values of n and m. In particular, it follows from (18) 
that 

' \nk^a? if i = n — 1 

' i(w 4- l)¥a^ if i =* n + 1 (19) 

% 

0 otherwise. 


Substituting these results into (17) we get 


El + 


Ei-El^r El-El^, J’ 


( 20 ) 


which, in view of the relations 5“, reduces to 

En‘- El - h‘‘h. 


( 21 ) 


On the basis of the present computation we may expect that the precise 
expressions for En might involve terms in e®, e*, and so on; but in fact the 
result (21) is exact, as shown in Exercise 2®®. 


By extending the developments of §67 and applying the results 
to our physical perturbation problem, we may evaluate one by 
one the terms of En involving e®, and so on; but in practice it 
is seldom of interest to go beyond the second-order correction 
included in (17). Note that the first-order correction given by 
(17) is tv\n i so that this correction vanishes whenever the diagonal 
elements of y^(a;°) are free from first powers of s; but remember 
that (17) was derived for the case when is nondegenerate. 
We shall leave it to the reader (Exercise 1) to show that the 
perturiation formula (17) is precisely equivalent to the Schroedinger 
perturbation formulas derived in §36. 


Exercises 

1. Identify the integrals in 22*® and 31*® with Heisenberg matrix ele- 
ments, and show that these formulas, when taken together, are equivalent 
to (17). 

2. Assume the matrices (7) to be available, and approximate by means 
of (17) the energy levels of an oscillator perturbed by the potential 

Use 10*® to check the result for the normal state of the perturbed system. 



380 


tHE HEISENBERG METHOD 


§68 


3. Assume the matrices (7) to the ayailable» and approximate by means 
of (17) the energy levels of an oscillator perturbed by the potential tx*; 
check the result by a reference to Exercise 4*®. 

4. The matrix each of whose elements is the time average of the corre- 
sponding element of a given matrix is said to be the time average of the 
given matrix. Show that, if the unperturbed system is nondegenerate, 
then the time average of the auxiliary matrix H [that is, the matrix <6>J 
is a diagonal matrix whose diagonal elements are the energy levels of the 
perturbed system correct to terms in €^. 


69. More Matrix Algebra 


The first and the second subscripts in the symbol aki for a 
matrix element denote, respectively, the row and the column of 
the matrix a in which this element belongs. The formulas to bo 
discussed in this section take on a somewhat more readily remem- 
bered form if we adopt a similar two-subscript notation for the 
elements of colunm symbols, and denote the Ath component of 
the column symbol uhy Uk. rather than by Uk as we did before; 
the first subscript (the subscript k) then tells us the row of the 
column s 5 Tnbol u in which the element Uk. belongs, while the 
second (the subscript •) reminds us that the symbol u has only 
one colunm. The new notation is illustrated in Equations (7), 
(10), and (13); we note, in particular, the similarity between (7) 
and (6), between (10) and (9), and between (13) and (12). 

Now, if we replace each element of a matrix a by its complex 
conjugate and transpose the matrix so obtained, then the resulting 
matrix, introduced in 19^, is called the complex conjugate of the 
matrix a and is denoted by a. If we treat a column symbol u 
manner the result is a row symbol; this symbol is 
called the complex conjugate of the column symbol u and is denoted 


by u. Thus the complex conjugate of the column symbol « = ( 

is the row symbol tz = (1? —f), the complex conjugate of t) = ^ —t j 

. - _ , . \2 4- i/ 

K ^ so forth. The elements of a row symbol it 

the Ath element will be 

denoted by u.k , so that, by definition, 


«•* = Cm*.)*, 


( 1 ) 



381 


THE HEISENBERG METHOD 


where is the complex conjugate of the number Uk- • Note 
that the two-subscript notation for the elements of U fits in with 
that for the elements of a matrix and of a colunm symbol: the 
first subscript (the subscript •) in H.k reminds us that the symbol 
in question has only one row, while the second (the subscript k) 
tells the column of u in which this element belongs.^^ Whenever 
we speak of a column symbol u as the vector w, we speak of the 
row symbol a as the complex conjugate of the vector u. The matrices 
of this section will each have N rows and N columns, while the 
column symbols and row symbols will each have N elements. 

If the corresponding elements of two row symbols v and iZ are 
equal to each other, that is, if 

u-k ^ v.k , A; = 1, 2, • • • , Nj (2) 

we write 

u = V (3) 

and say that the row symbols v and'u are equal to each other. 
The symbolic equation (3) and its interpretation (2) are recorded 
in (8); note the similarity between (8), (7), and (6). 

The rules for multiplying a row s3anbol by a constant and for 
adding two or more row symbols are patterned after the corre- 
sponding rules for matrices and column symbols. Thus, if two 
row symbols u and v are so related that 

v,k == ca.k , fc = 1, 2, . . • , iV, (4) 

where c is a constant, then we write 

V = cu (5) 

and say that v equals c times w. The symbolic equation (5) and 
its interpretation (4) are recorded in (14); note the similarity 
between (14), (13), and (12). Again, if three row symbols % v, 
and w are so related that w,k == u,k + v.k for every value of Aj, 
then we write w = u v and say that ’uD is the sum of u and v; 
this definition is recorded in (11). 

Our rules for adding two symbols require the addition of the 
similarly placed elements of the addends, and are thus unsuited 
for the case when the numbers of rows and columns of one symbol 

Note that the fcth element of u can be denoted without ambiguity 
by w.jfc rather than by w.* . 



382 


TEE HEISENBERG METHOD 


do not match those of the other; accordingly, our algebra does not 
provide for the addition of a matrix and a vector, a matrix and 
the complex conjugate of a vector, or a vector and the complex 
conjugate of a vector. 


(23“) a ■= p 

11 

1 

m (6) 

(28M) u = V 

wjb. = r*. 

N (7) 

u U 

a.k =• v.k 

N (8) 

(24«) 7 - a + /S 

yki = -h jSjbi 

m (9) 

(29**) to ^ u + V 

Wk- « ujfc. H- Vk. 

N (10) 

io = iZ + 5 

to.j « w.i + v.i 

N (11) 

(26“) ^ 

fikt “ cau 

m (12) 

(30“) V = ca 

Vk- - CWjfc. 

N (13) 

v = cH 

f}-i =® ca-i f 

N' (14) 

II 

1 

7jfci = 'Lau^ii 

N> (15) 

(27“) S = 

hid — 'L^kiotii 

(16) 

(31«) » = 

Vk- « ZajbtUi. 

N (17) 

€ ^ ua 

v-i = JdH-ian 

AT (18) 

C =‘ Uv 

C.. = Xu-iVi- 

1 (19) 


Our old rules for multiplying two symbols, neither of which is 
simply a number , imply that the left factor has as many columns 
as the right factor has rows, and hence allow for the multipli- 
cation of two matrices and for the multiplication of a vector from 
the left by a matrix; but they do not allow for the multiplication 
of a matrix from the left by a vector, so that the symbol not is 
to be regarded as meaningless. When these rules are extended 
to include row symbols, we find that the symbol ctil has no mean- 
ing, but that the symbol ua can be defined as the row symbol 
whose Mb element (««).* equals this definition is 

recorded in (18). Again, the symbol vH (as well as the symbol uv) 
has no meaning, while, as recorded in (19), the -symbol UV can 



383 


THE HEISENBERG METHOD 


§69 


be defined as the number the two dot subscripts of c.. 

in (19), though they are not altogether necessary, remind us that 
the quantity uv is a numerical constant (having only one row and 
only one column), and make (19) quite similar in appearance to 
the four equations preceding it. Examples of products involving 
row symbols are given in Exercise 1. 

Since the product ua is a row symbol, the product (ua)v is a 
number; this number turns out to be the same as u{av), that is, 
the product of u and the column symbol av; therefore we can use 
without ambiguity the symbol uav: 

uoto = (wa)«; = uipco). (20) 

We shall leave it to the reader to show that 

uav = Yj i = 1, 2, . • • , N. (21) 

According to (19), uu = j so that, recalling the interpre- 

tation of 13^®, we get 


uu = square of length of u, (22) 

Similarly, the condition 14®® for the orthogonality of two vectors 
u and V can be written as 


wt; = 0 (23) 

or as vw = 0. 

We shall leave it to the reader to show that the complex con^ 
jugate of a product of our symbols is the product of the complex 
conjugates of the factors taken in the reverse order, except that the 
position of a numerical factor is of course immaterial. For 
example, 

m = cUj = udiy Ha == aw, uv “ vu (24) 

and 


uav = vau. 


(25) 


These results are an extension of 24c®®. 

Hennitian matrices. The following proofs of the two theorems 
taken for granted in §58 illustrate the convenience of our sym- 
bolism. 

Every eigenvalue of a Hermitian matrix is real. Let a be a 
Hermitian matrix, so that 

( 22 ®®) 


a = a, 


(26) 



384 THE HEISENBERG METHOD §69 


and let X be the eigenvalue of a belonging to the eigenvector 
u of a, so that 

OLU = \u. (27) 

Multiplying (27) from the left by we get 

uau = \uu. (28) 

Taking toe complex conjugates of the two sides of (28), we get 
UM = Xtiw, that is, — hiu\ and since a satisfies (26) this 
becomes 

uoLU = \uu, (29) 

It now follows from (28) and (29) that X = X, and toe theorem 
is proved. 

The eigenvectors of a Hermitian matrix belonging to distinct 
eigenvalues are orthogonal. Let a be Hermitian and let u and v 
be eigenvectors of a belonging, respectively, to the eigenvalues X 
and ju, so that 

au = \u, av ^ ixv, (30a, b) 

where, according to the theorem just proved, both X and /i are 
real. Multiplying (30a) from the left by v and (30b) from the 
left by % we get 

vo(U = Xm, uav = iiuv. (31a, b) 

'^ng_toe complex conjugates of both sides of (31b), we get 
vx3to — nuv^ that is, vdiu '= jivu^ which reduces to 


Subtracting (32) from (31a), we finally get 
(X — ijl)vu = 0, 


(33) 


so that rw 0 whenever \ and the theorem is proved. 

interpretation. Just as N-component column 
i^^^rpreted as representing 
Arp representing the point whose coordinates 

row s'rahf^ 7 -^- j j Uir.; and so on), so iV-component 

points in annth' so on may be interpreted as representing 

otdmte. are Z ""d* ’'‘‘T Z 

■ecmd if-epwe is then esiled the dJd of the ^,“^<1 



385 


THE HEISENBERG METHOD 


It should perhaps be emphasized that it is improper to interpret 
column symbols and row symbols as representing points within 
the same iV-space, since such an interpretation would lead to 
inconsistencies;^® for example, this interpretation would suggest 
that a column s5rmbol and a row symbol can be added together, 
while the rules of our algebra do not provide for such an addition. 
When a mapping a is carried out in the space containing u, v, 
w, and so on, a corresponding mapping takes place in the dual 
space, since the row symbols Uj t>, w, and so on, are sent, respec- 
tively, into udij voij wdit and so on; but we need not discuss these 
mappings here. 


Exercises 


1. Show that if u 





then 


iJcK = (1 — 42, 10 — 7i, 14), wa — 00 ! = (1, 2, 3), Hv — 5, 0u * 6, uu « 6, 
0y = 5, uav = 20, vau « 20, dau = 16 — 7i, uav * 16 7i, Uw ^ 1 if 

WM * 1 — if Uw =* 0, and wv =■ 0. 

2. Show that when w, v, a, and jS are arbitrary we have wa + 0a =* 

(H + 0)a, au -h 0!V = <x(u -f v), Hot -f- « il(a -j- $), aw + /Su — (a + 

and uav + « tZ(a + j8)v. 

3. Prove (20) and (21). 

4. Prove (24) and (25). 

6. If u and v are vectors and a and jS are matrices, what is ila$v? Write 
down the complex conjugate of Ha^v in terms of w, a, and 0, and verify 
your result by a simple nontrivial numerical example. 

6. Show that in order for the matrix a to be Hermitian it is necessary 
and sufficient that the equation Uav « 0aw hold when u and v are arbitrary. 
Writing this as Hav = uav and then as Uav « aw«t;, we get the matrix equiva- 
lent of 19<*. 

7. Construct straightforward algebraic proofs (not involving symbolic 
manipulation) of the two theorems concerning the reality of the eigen- 
values and the orthogonality of the eigenvectors of Hermitian matrices, 
proved in the text by symbolic methods. 


W-component column symbols and row symbols can, however, be 
interpreted without inconsistencies in terms of a single N-space by asso- 
ciating the column symbols with points and the row symbols with (N — 1)- 
dimensional elements of the space. Thus when 2-component column 
symbols are interpreted as representing points in a plane, then 2-compo- 
nent row symbols can be interpreted as representing straight lines in the 
same plane; when 3-component column symbols are interpreted as repre- 
senting points in a 3-space, then 3-component row symbols can be inter- 
preted as representing planes, in the same 3-space ; and so forth. 



386 


THE HEISENBERG METHOD 


§70 


70. The Heisenberg Representation of States 

The Heisenberg method is particularly well adapted to the 
search for the possible values of dynamical variables, since it 
allows this search to be conducted without explicit reference to 
the representatives of the states of the system, that is, to the 
operands of the Heisenberg matrices; thus in computing the 
energy levels of the oscillator in §65 we dealt only with the Heisen- 
berg operators (matrices) associated with dynamical variables, and 
had no need to consider the operands of these operators. In the 
Schroedinger method, on the other hand, the study of the behavior 
of the operands is essential for the identification of the eigenvalues 
of the operator associated with a dynamical variable. The 
Heisenberg operands, however, come into play in studies, carried 
out by the Heisenberg method, of detailed properties of states; 
for example, in problems concerning the distribution-in-a; for a 
given state. But, as we shall see, the Heisenberg method is not 
particularly well adapted to problems of this kind. 

According to the operand rule IV of §17, states of dynamical 
systems are to be associated with the operands of the quantmn- 
mechanical operators. The Heisenberg operators are matrices; 
accordingly, their operands, which we shall denote by are 
vectors, that is, column symbols. Now, according to the defini- 
tion 2 , the Heisenberg matrix a associated with the dynamical 
variable a pertaining to a dynamical system with a Hamiltonian 
Hy is the matrix representing the Schroedinger operator a when 
the basic /s are the normalized time-dependent Schroedinger 
eigenfunctions of H ; similarly, we adopt the definition : the Heisen-- 
herg vector xj/ associated with a state xj/ of a dynamical system whose 
Hamiltonian is H is the vector representing the Schroedinger function 
xf/ when the basic f s are the normalized time-dependent Schroedinger 
eigenfunctions of H, 

To identify with the help of the Schroedinger theory a Heisen- 
berg vector ^ associated with the state we then take a Schroe- 
dinger function xp associated with the state expand it in terms of 
the normalized Schroedinger eigenfunctions , 1 ^ 2 , , • • • of if , 

getting 


\p — CiXpi -|- C2\p2 + . 


( 1 ) 



387 


THE HEISENBERG METHOD 


§70 


and construct the vector 





( 2 ) 


This is a Heisenberg vector representing the state it has, in 
general, infinitely many nonvanishing components. In particular, 
the Heisenberg vectors associated with the energy states of the 
(nondegenerate) system are, apart from an arbitrary multiplica- 
tive constant going with each vector, 



According to §16, the c’s in (1) are numerical constants, and 
consequently the Heisenberg ^^s are independent of t; in the Heisen- 
berg method, the time variation of the properties of a system in a 
given state is taken care of by the time dependence of the operators 
associated with dynamical variables, this situation being just the 
reverse of that in the Schroedinger method. 

The rdle of the Schroedinger ^^s is taken over in the Heisenberg 
method by the row symbols, which are conjugate to the Heisen- 
berg vectors and which, for a reason to be given later, we denote 
by 0's rather than by 0's. Thus the <t> corresponding to the 0 
given by (2) is 

0 = (5i,a2,a8, ..•) (4) 

and the 0's corresponding to the stationary states (3) are 

01 = ( 1 , 0 , 0 , 0 , . . 0 

02 = ( 0 , 1 , 0 , 0 , ...) ( 6 ) 

08 = ( 0 , 0 , 1 , 0 , • • •) 

and so forth- 

Note that, in view of the rules of the preceding section, a Heisen-^ 
berg 0 and a Heisenberg 0 cannot be added to one another. Since a 



388 


THE HEISENBERG METHOD 


§70 


Schroedinger ^ can obviously be added to a Schroedinger ip, 
this result may at first sight seem strange. But as our previous 
work with the Schroedinger theory illustrates, the necessity of 
adding a ^ and a ip never arises in the formalism of the Schroe- 
dinger method. Similarly, the necessity of adding a ^ to a 
never arises in the Heisenberg theory, and consequently the 
impossibility of carrying out such an addition causes no difiSculty. 

We shall now identify the Heisenberg equivalent of the Schroe- 
dinger expectation formula 

(2^®) av^a = j ipcxxpdXy {6) 

where a in the integrand is the Schroedinger operator associated 
with the dynamical variable a, and ^ is a normalized Schroedinger 
function describing the state \p. Expanding \p in the form (1), 
we get 

av^a = [ Ciipi)a(J^j' Ci\pj) dx 

(7> 

= ^ [/ J 

the bracketed integral is the Heisenberg matrix element aif , so 
that (7) becomes 

aV^Q! = 

The right side of (8) has precisely the same form as that of 
21®®; recalling that c,- is the jth element of the Heisenberg vector 
p and that c» is the ith element of the corresponding Heisenberg P, 
we finally replace (8) by 

av^a = <pap, (9) 


which is thus the Heisenberg equivalent of (6). If the Schroe- 
dinger ^-function is not normalized, then av^a is given by 2^^ 
rather than by (6), and (9) becomes replaced by 


av^a — 


<l>oap 


< 10 > 


where a in the numerator is the Heisenberg matrix associated 
with the dynamical variable a, p is the Heisenberg vector repre- 
senting the state p, and p, a row symbol, is the complex conjugate 



389 


THE HEISENBERG METHOD 


§70 


of the vector Equation (lO), the Heisenberg expectation 
formula, reduces to {9) whenever (pxp = 1, that is, whenever the 
Heisenberg vector \l/ is normalized. 

Note that, as a comparison of (6) and (9) shows, the 
Heisenberg expression <t>a\l/ is equivalent in the one-dimensional 
case to the Schroedinger expression and not to the 

Schroedinger expression ^axp; in particular, both and / dx 
are numbers, while i/aip is a function of x. It is to lessen the 
possibility of confusion in this' connection that we denote the 
conjugates of the Heisenberg ip's by symbols other than ^'s. 

Now, how can the Heisenberg i/^’s (identified above with the 
help of the Schroedinger theory) be constructed without reference 
to the Schroedinger To do this we interpret (lO) as a 
fundamental assumption of the Heisenberg method; this assump- 
tion might be labeled V^, since it plays the same role in the 
Heisenberg method that the assumption V® of §17 does in the 
Schroedinger theory. The procedure for constructing the Heisen- 
berg \p that specifies a given state of a given system then consists 
in identifying the column symbol which, when used in (10) 
in conjunction with the Heisenberg matrices pertaining to this 
system, yields average values of the various dynamical variables 
in agreement with the data that specify the state verbally. Unless 
the state in question is an energy state, this procedure is usually 
complicated. 

When the Heisenberg matrices pertaining to a dynamicjal 
system are available and the Heisenberg \p specifying a state of 
this system is given, then (lO) enables us to compute the average 
of any dynamical variable for this state. In particular, we may 
then compute the averages of all integral powers of a given dynami- 
cal variable and hence infer, at least in principle, the distribution 
function of this variable for this state; but unless the dynamical 
variable in question is a constant of motion, this procedure is 
usually tedious, if not altogether impracticable, when the explicit 
form of the distribution function is desired. 

Exercises 

1. Infer from 16*^ the Heisenberg specifying the state of an oscillator 
which in the Schroedinger theory is specified by the \A-f unction 19®^; then 
use (9) to show, with the help of 9®®, that for this state of the oscillator 
av a? *= rro cos cot, where co =» 2Trpe . (In §21 we deduced this result from 20®^. ) 



390 


THE HEISENBERG METHOD 


§70 


2. Suppose that the Heisenberg of Exercise 1 is available and explain 
how the distribution-in-a; for the corresponding state can then be founds 
at least in principle, without reference to Schroedinger functions. 

8, Recall 9*® and explain how the distribution-in-j& for a state can be 
obtained by inspecting the Heisenberg ^ specifying this state. 

4* Recall that the Heisenberg energy matrices are diagonal and use 
(lO) to show, without reference to the Schroedinger that the Heisen- 
berg ^'s specif 3 ring the energy states of a nondegenerate system have the 
form (3) . Remember that the nth energy state is one for which av JS? *= 
En and AEf » 0. 

5. Deduce 4“ from (10), 



Chapter XI 


REMARKS ON THE SYMBOLIC METHOD 

We have by now illustrated three quantum-mechanical methods 
of associating specific operators with dynamical variables: the 
Schroedinger method, the p-method, and the Heisenberg method. 
The quantum condition ap — fia == ih[a, /3] of §13, however, 
fixes the commutation rules that the quantum-mechanical oper- 
ators must satisfy, but does not specify in detail the individual 
operators themselves; and, as pointed out in §13, this implies 
that detailed specification of the individual operators is not 
necessary for the computation of the theoretical data to be com- 
pared with experiment, and that quantum-mechanical calcula- 
tions may be carried out by the purely symbolic methods of 
operator algebra. In this chapter we make a few remarks con- 
cerning the symbolic method introduced by Dirac and emphasized 
in his book, The Principles of Qmntum Mechanics. 

71. Symbolic Operators and Operands 

The symbolic method can be formulated without reference to 
the Schroedinger method, the Heisenberg method, and so forth; 
but to a beginner its rules, which are of necessity abstract, are 
then likely to appear rather strange, and we shall therefore first 
list a few of the more abstract properties of our familiar operators 
and operands. 

We begin with the operands, which we have uniformly denoted 
by and which are functions of x in the Schroedinger method, 
functions of p in the p-method, and column symbols in the Heisen- 
berg method. The ^^s used in any one of these three methods have 
the following properties; 

(a) it has a meaning to say that two ^'s are equal to one another;^ 


1 Given two Schroedinger functions, and , we can, of course, deter- 
mine whether = ^2 or ^1 9 ^ ^2 ; again, given two Heisenberg column 
symbols, and ^2 , we can determine whether ^ or ^1 ^2 . At 

present, however, we are concerned merely with the fact that an equation 
of the form = ^2 has a meaning (one meaning in Schroedinger's case, 

391 



392 


THE SYMBOLIC METHOD 


§71 


(b) any two can be added together, the result being a ^ ; 

(c) there exists a i/^, called zero, which, when added to any other 
ypf leaves the latter unaltered. 

The Heisenberg being column symbols, cannot be multiplied 
together; the Schroedinger ^'s can be multiplied together from 
the mathematical standpoint, and so can the i/^^s of the p-scheme, 
but the need for such a multiplication never arises in the quantum- 
mechanical calculations; thus, quantum mechanically, 

(d) the multiplication of two ^’s together is meaningless. 

Turning next to the operators, to be denoted for the moment by 

a^s, we find that the a’s used in any one of the three methods have 
the following properties: 

(e) it has a meaning to say that two a^s are equal to one another; 

(f) any two cx!b can be added together, the result being an a; 

(g) any two a’s can be multiplied together in two ways, de- 
pending on the order of the factors; and the two products, 
both of which are are usually different, so that the 
multiplication of the a’s is usually noncommutative ; 

(h) some a’s, called constants or numbers, commute with every 
a] and 

(i) the multiplication of the a's is associative. 

Insofar as the interplay of operators and operands is concerned, 
we note that within any one of the three methbds 

(j) the addition of an a and a>\l/ is meaningless; 

(k) any ^1/ can be multiplied from the left by any a, and the 
product is a 

(l) the multiplication of an a from the left by a ^ is meaningless ; 
and 

(m) the a^s are linear operators.' 

Now, in the s5niibolic approach, the mathematical form of the 
operators associated with dynamical variables is left unspecified ; 
that is, the a’s are not required to be differential operators, or 
matrices, or operators of any other particular type. The mathe- 
matical form of the operands associated with states is also left 
unspecified; that is, the are not required to be functions of Xj 
or functions of p, or column symbols, or operands of any other 


another in Heisenberg^s, and still another in the case of the p-method), 
and not with the tests necessary to verify such an equation. Similar 
remarks apply to the assertion (e) of this section and the assertion (n) 
of §73. 




393 


THE SYMBOLIC METHOD 


§71 


definite type. The symbolic a^s and are, however, required 
to have the thirteen properties (a) to (m) listed above, and conse- 
quently they embody the essential more abstract characteristics of 
the various specific quantum-mechanical operators and operands. 

If a symbolic operator o:, a symbolic operand xp other than 0, 
and a number X satisfy the symbolic equation 

ail/ = (l) 

we say that X is an eigenvalue of a, xp is an eigen^i^ of a, and that 
the eigenvalue X of a and the eigen-^ \pof a belong to one another. 
The noncommittal term ^eigen-^' now takes the place of the more 
explicit terms ^eigenfunction' and ^eigenvector.' 

72, Energy Levels of an Oscillator 

Before further abstract discussion, we shall illustrate the sym- 
bolic scheme by computing the energy levels of a linear harmonic 
oscillator by a method duo to Dirac. Apart from the fact that 
we shall use symbolic operands, our procedure should not seem 
unfamiliar, because we have already handled symbolic operators — 
for example, in Exercise 7^ But the procedure will be somewhat 
indirect, the main reason for this being that our computations are 
symbolic equivalents of integrations; and even elementary inte- 
grations often use indirect approaches. 

The classical Hamiltonian of the oscillator is H /2m 
-b so that, if the symbolic operators associated respectively 
with the coordinate x and the momentum p are denoted by x 
and p, the symbolic Hamiltonian is 

H ^ + lkx\ (!) 

According to the operator assumption I of §13, our mathematical 
problem is to find the eigenvalues of the operator (l), that is, 
the values of the number E for which the symbolic equation 

HxP = ExP ( 2 > 

has symbolic solutions \pj other than xp = 0, We know further 
that, according to the quantum condition 1^^, the symbolic 
operators x and p satisfy the commutation rule 

(5^®) xp — px ^ Hi, 


(3> 



394 


THE SYMBOLIC METHOD 


§72 


Let us iGirst suppose for the moment that the oscillator has an 
energy state for which E is negative. For this state, as for any 
other state, the average value of the energy would equal the sum 
of the respective averages of 'p^/2m and Pa:*. Since the state in 
question is an energy state, the average of the energy would be 
just E] and since E has been assumed to be negative it follows 
that for this state the average of or the average of 

or both, would have to be negative. But all experimentally pos- 
sible values of and Pa:* are obviously positive. We 

therefore conclude that the assumption that E is negative is not 
tenable, and that 

E > 0. (4) 

We next introduce the symbolic operators 

X = ■\/kx, P = p/^/m, (5) 

and find, using (3), that 

XP-PX^ihv,, <6) 

where v* is the classical frequency of the oscillator, and that 

H = iP* + fZ*. (7> 

Now, 

(P -h iX){P - tZ) = P* + Z* -f z(ZP - PZ) 

= 2H - hvc (8> 

and 

(P - iX){P + iX) = P* + z* - i{XP - PZ) 

,= 2H + hvc , (9) 

SO that, on the one hand, 

(P - *Z)(P + iZ)(P _ iX) 

= (P - iZ)[(P + iX){P - iX)] = (P - iX) (2H - hvc), (10) 
while on the other 

(P - iX)(P + iX)iP _ iX) 

= [(P - tX)(P + tZ)](P _ iX) = (2H + hvc)iP - iX).. <11> 



395 THE SYMBOLIC METHOD §72 

Thus 

(P - iX)i2H - hvc) = (2H + hvcKP - iX), <12> 
or, since hvc is a number and therefore commutes with P — iX, 
(P - iX){H - hvc) = H(P - iX). <13> 

Let us now suppose that H has an as yet undetermined eigen- 
value En , SO that there exists a nonvanishing symbolic operand, 
say yj/n , satisfying the equation 

. <14> 

Subtracting hv,}iin from both sides of (14), we get 

(fl — hv^yj/n — {Bn — hvc)ypn, (l5) 


and multiplying both sides of (15) from the left by the operator 
P — iXy we get 

(P - iX){H - = (P - iX)iEn - hy;)yl^n ■ (16) 

In view of (13) and the, fact that En — hve is a number, this 
reduces to 

H{P - iX)yPn = - hv,)iP - iX)yPn , (l7) 

or, if we use the notation 

(P-fZ)^„ = ^„_i, (18) 

to 

= {Bn — hv^ypn-l • (l9) 

Equation (19) has the form (2) and means that [unless ypn-\ = 
(P — iX)^n = 0, a possibility which for the moment we dis- 
regard] ypn~i is an eigen-^ of H, the corresponding eigenvalue 
of H being En — hvc. We have thus shown, barring, the pos- 
sibility that yf/n-i = 0, that, if En is an eigenvalue of H, then 
En — hvc is also an eigenvalue of H. Bepetition of the argument 
shows that, if En — hvc is an eigenvalue of H, then [barring the 
possibility that V'n -2 = (P — iX)y('n-i = 0] E„ — 2hvo is also an 
eigenvalue of H, and so on, so that, if En is an eigenvalue of H, 
then the energies 

En , En — hvc , En — 2hVc > En -— ShVe ) * ‘ * j 
are each an eigenvalue of H. 


( 20 ) 



396 


THE SYMBOLIC METHOD 


§72 


Now, unless the series (20) terminates, it will include negative 
terms that according to (4) are impossible. Consequently the 
series must terminate, and we must investigate the circumstances 
that lead to termination. If we denote the last term in (20) 
by £o and the corresponding symbolic eigen-\p of if by ? our 
mathematical problem is to find the condition under which the 
equation 

H\po = Eq\Po , .[^0 ^ 0] (21) 

when treated in the same way as we treated (14), will not lead to 
the conclusion that Eq — hvc is also an eigenvalue of H. 

If we replace by i/^o in (14), Equation (17) becomes 

HiP - iX)h - {E^ - hvc)(P iX)^l^o (22) 

and implies that — hvc is an eigenvalue of H unless 

(P - = 0. (23) 

But if (23) holds, then 

(P + fZ)(P~iZ)^o = 0, (24) 

so that according to (8) 

(2ff - hvc)h = 0, <25) 

that is, 


= \hvcypQ , 

and, according to (21), 


<26) 


Po^o = ; 

the Mgular brackets in the number of the last 
us that the operand is symbolic. Since i/'o 
IS thus equivalent to the equation 


<27) 

equation remind 
0, Equation (23) 


■ft o = Wc , 


tainsTheTerm^l'?,^^^ series (20) fails to terminate unless it con- 
with if SW ^ contains this term it terminates 

form ’ ^ termmate, it follows that (20) has the 




( 29 ) 



397 


THE SYMBOLIC METHOD 


§72 


that is, 

[n + \]hvc , [(n - 1) + k]hvo , • • • , hhve , (30) 

where n is some positive integer. This result is our familiar 

En = {n \)hvc , = 0, 1, 2, • • • , (31) 

except that we have not yet shown that the series (30) should 
extend without limit to the left (Exercise 1). 

These computations show how the formula (31), which wc 
have now obtained for the fourth time, can be derived sym- 
bolically, that is, without ascribing explicit mathematical forms 
to the quantum-mechanical operators and operands. Another 
example of symbolic calculations will be given in §85. 

Exercises 

1. Show symbolically that, if E is an eigenvalue of (7), then E -h kifc 
is certainly also an eigenvalue of (7). 

2. Correlate the symbolic result ^that (19) follows from (14) with the 
nonsymbolic formula 55^ and find a nonsymbolic equivalent of the fact 
that (28) follows symbolically from (23). 

73. The Symbolic 

We now turn back to the Heisenberg method and note that the 
Heisenberg a's, which are matrices, the Heisenberg ^^s, which 
are column symbols, and the Heisenberg which are row 
symbols conjugate to the have the following properties in 
addition to those listed in §71: 

(n) it has a meaning to say that two (l>^s are equal to one 
another; 

(o) any two <^’s can be added together, the result being a <f>] 

(p) there exists a 0 that when added to any other <j> leaves the 
latter unaltered; 

(q) the addition of a 0 and a V' is meaningless; 

(r) any ^1/ can be multiplied from the left by any and the 
product <l>\p is a number; 

(s) the multiplication of a ^ from the right by a is meaning- 
less; 

(t) any a can be multiplied from the left by any <i>, and the 
product <^)q: is a </>; 

(u) the multiplication of an a from the right by a <;!> is meaning- 
less unless a is a number; 



398 


THE SYMBOLIC METHOD 


§73 


(v) if c is a number, then ot> (\>c) and 

(w) the numbers and are equal to one another, so 
that either can be denoted simply by <l>a\[/. 

Now, the symbolic method employs, besides the symbolic 
and the symbolic a’s, also s 3 m.bolic 0’s that are postulated to 
have the properties (n) to (w) listed above. These 0’s are called 
by Dirac conjugate imaginaries of the symbolic 0’s. No specific 
mathematical form is ascribed to these 0’s; in particular, they 
are not required to be row symbols. 

One of the physical assumptions of the symbolic method is 

V. If a system is in the state specified by the symbol 0, then 
the expected average of any dynamical variable a is 


(had/ 

^ 00 ' 


<i> 


where a in the numerator is the symbolic operator associated with 
the dynamical variable a, and 0 is the conjugate imaginary of 0. 

Equation (1), the symbolic expectation formula, is the symbolic 
generalization of the Schroedinger expectation formula 1^^, of the 
complete expectation formula of the p-method (of which 10®® is a 
special case), and of the Heisenberg expectation formula 10’'® 
(which has exactly the same appearance as (l), but in which 
0 and 0 are a column symbol and a row symbol, rather than 
purely abstract symbols). If 0 is such that 


00 = 1 , < 2 ) 

in which case the symbol 0 is said to be normalized, then (l) 
reduces to 


av^ a = 00:0. {3) 

To give an example of a symbolic computation involving we shall 
now show that, if 0 is an eigen-^ of a belonging to the eigenvalue a of cc, 
that is, if 


o£0 = a0, ^4) 

then the value of the dynamical variable a. for the state 0 is certainly a. 
Indeed, multiplying ( 4 ) from the left by the conjugate imaginary of 0, 
we get 0a0 « 00^, that is, <^wp = and 0a0/00 * a, so that, according 
to (1>, 


av^ a — a] 


( 6 ) 



399 


THE SYMBOLIC METHOD 


§73 


multiplying (4) from the left by a, we get ojV = = ccaij>, so that 

a^ip = ( 6 ) 

from which it follows that that is, 

av^ {7) 

thus av^ a® = (av^ a)*, and the fact that the value of a for the state ^ 
is certainly a follows from (5) and 20^^. 

The Schroedinger equivalent of the multiplication of a symbolic 
equation by <f> (which multiplication must be done from the left) 
is the multiplication of the corresponding equation of the Schroe- 
dinger scheme from the left hy followed by integration, rather than 
merely multiplication by f ; for example, in the one-dimensional 
case, the Schroedinger equivalent of the symbolic step from the 
equation a\l/ = a^l/ to the equation is the step from the 

equation ail/ = a\p to the equation dx = dx. Hence 
the Schroedinger equivalents of the symbolic which we shall 
not discuss, are not recognized as readily as arc their Heisenberg 
equivalents. 

We may, if we wish, picture the symbolic as vectors in a 
certain space, the i/^-space, and picture the symbolic a's as map- 
pings of this space; the symbolic <^>’s are then to be pictured as 
vectors in another place, the <#>-space. That such a quasi-geo- 
metric interpretation will not load to contradictions follows from 
the complete parallelism between the rules for handling the 
symbolic a^s, and <^’s, and those for handling the Heisenberg 
\p^s, a^s, and 

With these few remarks, we end our discussion of the symbolic 
method. To build up systematically the rules for manipulating 
the symbolic ^^s, a!s, and <^)^s, and to introduce such notions as 
the orthogonality of two \^^s, the complex conjugate of an a, 
and so on, would merely mean rewording appropriately our 
familiar vector-matrix rules and definitions; but to go further 
than that, to show, for example, how Schroedinger functions can 
be expressed in terms of the symbolic would require rather 
advanced discussion. 

Exercises 

1. Correlate the symbolic steps that we used in deducing the physical 
implications of (4) with the explicit steps used in §17 in deducing the 
physical implications of 7^’. 



400 


THE SYMBOLIC METHOD 


§73 


2. If the symbolic operator a satisfies the equation for 

every pair of symbolic operands ypr and , then a is said to be Hermitian, 
Correlate this definition with 19*^, and show symbolically that every 
eigenvalue of a Hermitian operator is real. 

3. Show that in view of the assumption V the second sentence in the 
operator assumption I of §13 can be deleted. 



Chapter XII 


PEOBLEMS IN THREE-DIMENSIONAL MOTION; THE 
SCHROEDINGER METHOD 

We shall now consider a few problems concerning phenomena 
taking place in our physical three-dimensional space. The 
Schroedinger method for handling these problems is a direct 
extension of that of Chapters HI to VII, referring to fictitious 
one-dimensional motion; several of the more general results of 
these chapters will, in fact, reappear almost word for word in the 
present chapter. Some of the numbers at the left of our new equa- 
tions are those of the corresponding ohe-dimensional equations. 

74. Mathematical Preliminaries 

We use the Cartesian frame of Fig. 100 and denote the point 
whose coordinates are x, and z by the symbol xyz] wherever the 
symbol xyz is used for the product of 
the Cartesian coordinates of a point 
rather than for the point itself, this 
fact will be stated explicitly. The 
vector with origin at 0 and terminus 
at xyz is called the position vector and 
is denoted by r; its length is denoted 
by r. We denote functions of x, y, 
and z by u(x, y, z), v(x, y, z),f{x, y, z), 
and so on, or simply by u, v, f, and so 
on; their complex conjugates by u{x, y, z), v{x, y, z), f(x, y, z), 
and so on, or by u, v, /, and so forth; and their moduli by | u{x, 
y, 2) 1, 1 vix, y, z) I, \f(x, y, z) 1, and so forth, or by | u |, 1 |, 1/ |, 
and so forth. 

If u is everywhere a continuous function of x^ of y, and of z, 
and if each of the three partial derivatives du/bx, bujby, and 
bu/bz is everywhere a continuous function of rr, of y, and of z^ 
except perhaps at isolated points, then we say that u satisfies our 
standard continuity conditions. If u has a single value at each 
point in space, we call it single-valued. If u satisfies our standard 
continuity conditions, is single-valued, and is not identically 

401 




402 THE SCHROEDINGER METHOD §74 


zero, and if j m 1 is everywhere finite/ then we say that u is weU- 
behaved. 

We also use the spherical polar frame of Fig. 101 and denote 
the point whose coordinates are r, d, and <l> by the symbol 
The relations between the Cartesian and the polar coordinates 
of a given point are 




X — r sin d cos <t) 
y ^ r sin d sin 

z = r cos d 


r = \/'x^ + y^ + 
B = cos 


( 1 ) 


== tan ^ - 

X 


Note that, while in the Cartesian frame every point has an un- 
ambiguous single set of three coordinates, this is not so in the 

polar frame; for example, the set 
r 1, 0 = Jtt, and <^ = speci- 
fies the same point in space as does 
the set r = 1, ^ = Itt, and^ == i'»r; 
this property of the polar frame is in 
part responsible for the stress that 
we now put on single-valuedness. 
If x^ y, and z are eliminated from 
the function u = u{x, 2/, z) by means 
of r, 6j and </>, then we denote the 
resulting function of r, 6j and by u{r, 6^ <l>) or simply by u 
[strictly, the u in 'w(r, 0, <!>) should then be replaced by another 
symbol]; for example, if u = u{x, y, z) = then we 

wnte u = u(x, Bj <#>) = although in this particular case wc 
may write u = u{r) = 



1 By “I w|is everywhere finite'* we mean not only that | u | is finite (that is, 
not infinite) at every particular point, but also that | u [ does not increase 
without limit when r — 4 w, that is, when the point xyz recedes (in any 
manner whatever) from the origin 0; thus the function is not well- 

behaved because, for example, it grows without limit if the point xyz re- 
cedes from 0 along the r-axis. Incidentally, the complete conditions for 
good behavior of u, which we need not consider in our elementary work, 
allow u to become infinite at isolated points, provided its infinities are so 
mild that certain integrals involving u are finite; see Kemble, §23d. 



MOTION IN THREE DIMENSIONS 


In accordance with the convention stated above Equation 29® 
we use the abbreviations ’ 


j fdx = J fdx. 

j fdy = j_Jdy, j fdz = j fdz, 

(2) 

and 




= J fde, j fd4> = j fd4>. 

(3) 

Further, we denote the differential volume element by dr and 
the sjnnbol 

use 


ffdr 

(4) 

for the definite integral of / taken over the entire space. For 
example, if Cartesian coordinates are used, so that dr = dx dy dz 
we let ■’ 

jfir 

rm roo roo 

- j_^ I ^/(®, y, z)dx dy dz; 

(5) 


again, if spherical polar coordinates are used, so that dr = 
r dr sin 6 dd d 4 >, we let 


I ^ lo Jo Jo ^ 

the integration with respect to <#. extending from 0 to 2ir, that with 
respect to 6 extending from 0 to tt, and that with respect to r 
extending from 0 to oo . 

If the integral Juu dr is finite, then u is said to be of integrable 
square or quadratically integrable (in the infinite region). If 

(IQ*) J uu dr = 1, (7) 

then u is said to be normal, unitary, normalized to unity, or simply 
normalized (in the infinite region). If / is quadratically integrable 
(in the infinite region) and A'' is a number such that 

(22*) NN = (/;/dr)-‘, (8) 

then the function iV/ is normalized (in the infinite region); the 
number N (the normalizing factor of /) is indeterminate to the 
extent of a phase factor e'’’, where 7 is an arbitrary real constant. 



404 


THE SCHROEDINGER METHOD 


§74 


If the functions u and v satisfy the condition 

(3®) j v!0 ir ^ (9) 

or the equivalent condition jvu dr = 0, then they are said to be 
mutually orthogonal (in the infinite region). If every two func- 
tions in the set 

UijUiyUs, • • • ( 10 ) 

are mutually orthogonal in the infinite region, then the set is 
said to be orthogonal (in this region); if, in addition, each Uk is 
normalized, then the set is said to be orthonormal (in the infinite 
region). 

If a function /admits of an expansion in terms of the orthogonal 
set (10), that is, if / can be expressed throughout the infinite 
region in the form 

(8‘) / = , fc = 1, 2, 3, • . . , (11) 

then the expansion coefficients are given by the formula 

(12*) c* = , (12) 

iHhUhir 

which reduces to 


(21*) c* = j (13) 

whenever the set (10) is orthononnal. The proof of (12) runs 
qffite parallel to that of 12*. 

Differentia operators. The operators with which we shall now 
deal and which for lack of a better short name we call differential 
operators are combinations of the operators x, y, and 2 (which 
stand, respectively, for the multiplication of operands by the 
viable X, the variable y, and the variable s), the operators d/dx, 
iijdy, ^d 0/02 (which stand, respectively, for partial differentia- 
tion of operands with respect to x, with respect to y, and with 
resect to z), and the operators of type c (which stand for the 
inuJtiphcation of operands by numerical constants). Equality, 
a ion, naultiplication, and so forth, of these operators are 
fnr+i,* w terms as commutator, and so 

ST1U -iur f +1, reader to show, in particular, that 

0 0 e t ree operators x, y, and z commute with each 



405 MOTION IN THREE DIMENSIONS §74 


other, that any two of the three operators d/bx, d/by, and b/bz 
commute with each other, and that 


a 

a 


a 

a 

1 

a 


d 



dx 

- X — 

bx 

= 1 


-^Ty = 

bz^ 

— z 

'bz ^ 

1 

( 14 ) 

a 

a 

= 0 

b 

b 

0 

d 


a 



bx'^ ‘ 


- 

by 


r ^ 
bz 

— X 

Fz "" 

0 

( 15 ) 

a 

dx^ ’ 

a 

- z — 
bx 

= 0 

b 

^y 

II 

H 

1 

0 

b 

bz^ 

” y 

11 

0. 

( 16 ) 


The differential operator b^/bx^ + 9®/ ay* + 9*792® is called the 
Laplace operator, or the Laplacian", we abbreviate it by the symbol 
V , usually read del squared-. 


V* = — + — + — , 
dy^ dz^ 


( 17 ) 


Other differential operators of interest to us are combinations of 
the operators r, d/ 0r, d/bO^ d/d<l>, and c; these (polar) operators 
can be expressed in terms of the (Cartesian) operators introduced 
above, and vice versa. To illustrate, we consider the operator 
b/bz. According to the rules for partial differentiation, 


bz 


u = 


a a , 90 a 


bz br 


bz bS 


( 18 ) 


in view of (1), 


br b z 

^ ^ Va:* + y® + 2^ = — 7===_^ - - 

oz bz y'j.s r 


= - = cos 6, 


bb b _i 2 
— = — COS - = 


b z 


bz bz r _ 22 y'j .2 92 r 


and 


SO that (18) becomes 


■ ■ — - - - sin® 9 = - - sin 9, 

Vl - 2 */r® r 


bz 


= 0 , 


(19) 


( 20 ) 


( 21 ) 



406 


THE SCHROEDINGER METHOD 


§74 


and since (22) holds for an arbitrary operand u on which the 
Indicated operations can be performed, we conclude that 


9z 


9 1 • o 9 

= cos 9 sin 9 — . 

9r r dd 


(23) 


Eigenvalues and eigenfunctions. If a differential operator a, 
a weVrbehaved operand u, and a numerical constant X satisfy the 
equation 

(15*) cm = Xu, (24) 

then we say, as before, that X is an eigenvalue of a, that u is an 
eigenfunction of a, and that the eigenvalue X and the eigenfunction 
u of the operator a belong to each other. The computation of 
the eigenvalues of operators in the three-dimensional case is of 
course usually more involved than in the one-dimensional case, 
but is much the same in principle except for the new emphasis 
on single-valuedness. 

To illustrate the effect of requiring that u in (24) be single- 
valued, we shall compute the eigenvalues of the operator — i d/d<t> 
for which (24) becomes the differential equation 

—i^u = Xu. (26) 

d<t> 

Tike general solution of (25) is 

u = Air, e)e^\ (26) 

where A(r, 6) is an arbitrary function of r and 6, and it remains 
to determine the circumstances under which (26) is well-behaved. 
Now, the behavior of (26) depends of course on that of both 
Air, 6) and e**; of .i4(r, d) we can say only that it should not 
spoil the behavior of u [the choice Air, 0) = constant, for example, 
is satisfactory]; but the second factor, whose explicit form is 
available, enables us to derive a limitation to be put on X. Indeed, 
since the two sets of coordinates (r, 0, 4>) and (r, 0,<l> + 2t) specify 
the same point in space, the value of the function u, if u is to be 
single-valued, must remain imaltered when we replace (j> by -1- 2?r,‘ 
in other words, the condition of single-valuedness requires that 
(26) satisfy the equation 


Air, 0)e^* = Air, 0)e^^*^^^\ 


(27) 



407 


MOTION IN THREE DIMENSIONS 


§74 


that is, 


1 = € 


2«-iX 


(28) 


Equation (28) holds if and only if 

X , -2. -1,0, 1,2, (29) 


SO that the eigenvalues of — i , ore the 'positive and the negative 

0(p 

integers j zero included. Note that, because of the cyclic character 
of the coordinate the eigenvalue spectrum of — i d/d(t> is discrete, 
and is quite different from that of, say, — i b/ dx. In dealing with 
the operator —ib/b^j the symbol X is usually replaced by the 
symbol m, and the eigenfunctions of —i b/ b<l) are then written as 

Um = Atn(r, = 0, dzl, ±2, • • • , (30) 


where the subscript m of A reminds us that the arbitrary well- 
behaved factor A(rj 6 ) may be different for different values of m 
The 0-factors in (30) have the form 28®, and hence the quantity 
vanishes whenever m 9 ^ for this reason the set 
(30) is said to be 'orthogonal in 0/ 

The Legendrian.^ An operator of special interest to us is 


_a . a 

sin e' bO b& sin2 B b 4 >^ ’ 


(31) 


which we call the Legendre operator , or simply the Legendrian; its 
Cartesian form, with a reversed algebraic sign, is shown on the 
left side of (74). To compute the eigenvalues of the Legendrian, 
we look for the values of X for which the equation 


(-±.1 

\ sin^ dO 



1 

sin* 6 5^7 


M = Xm 


(32) 


* The Legendre operator plays a fundamental r61e in the quantum 
theory of angular momentum; our discussion of it is, however, not ex- 
tensive, and one of the important tasks that the reader should eventually 
take up is a more elaborate study of the eigenfunctions of this operator, 
called surface spherical harmonics and discussed in mathematical works 
on spherical harmonics, in classical treatments of potential theory, and 
in some books on quantum mechanics. A clear elementary presentation 
(but written from a standpoint somewhat different from that adopted 
here) is given in Chapter VIII of Jeans; our presentation follows in part 
that of §2A of Sommerfeld. 



408 


THE SCHROEDINGER METHOD 


§74 


has well-behaved solutions. The Legendrian does not involve r 
or 9/3r, and consequently we may temporarily take w to be a 
function of B and only, and later permit the arbitrary constants 
in the solutions of (32) to be functions of r. 

Separation of variables. The partial differential equation (32), 
called the Legendre equation, can be separated into two ordinary 
differential equations in much the same way that the partial 
differential equation 7*^ was separated into the two ordinary differ- 
ential equations 11^ and 12^. Indeed, turning to u’b of the 
special form u == 6(9)^>(0), or, for short, 

u = e^> (33) 

(where 0 is a function of B only, and 4* is a function of (t> only), 
substituting (33) into (32), and dividing the resulting equation by 
— 04>/sin^ B, we get 


sin 9 9 
bB 



+ X sin^ B 


1 ^ 
^ 9 ^ 2 * 


(34) 


Note that in (34) the parentheses are not strictly necessary, and 
9 s can be replaced by d^s. Now, the left side of (34) depends 
only on 9, while its right side depends only on 0, so that (34) 
has the form f{B) = gr(<^); and, since B and 0 are independent 
variables, it follows that the left side of (34) cannot in fact depend 
even on 9, while its right side cannot in fact depend even on 
In other words, each side of (34) is a numerical constant, say c\ 


and 


sin 9 a 
99 



+ X sin^ 9 = 0 


(35) 


We sha;ll call (35) and (36) the *9-equation’ and the ^<^-equation/ 
respectively. 

The inequation. The operator is the square of the 

operator i d/d<j), so that its eigenvalues are the squares of the 
Mpnvalues of -i d/d^, that is, the squares of the integers (29). 

ere ore Equation (36), which is just the eigenvalue equation 
or e operator — 9 /d<l > , has well-behaved solutions if and only if 

= m = 0, ±1, ±2, . . . . (37) 



409 


MOTION IN THREE DIMENSIONS 


§74 


Replacing c in (36) by we find that 

$ = Ae'’^* + Be”’’”*, (38) 

where A and B are arbitrary constants — constants in the sense 
that they are independent of both 4> and d. 

The d-equation. Replacing c in (35) by and changing from 
d’s to d's, we get 

(^sin 0 + fx 0 = 0. (39) 

sva.ddd\ dJiJ \ sivAd/ ’ 

When we introduce the new independent variable 

p, — cos 9 . (40) 

whose range is 

(41) 

and note that ® Equation (39) becomes 

=0- (42) 

Our purpose is now to find the value of X for which (42) has well- 
behaved solutions. 

In treating (42) it is convenient to write 

0 = (1-mV'’"'«(m), (43) 

where » is an as yet undetermined function of p; substituting (43) 
into (42), we then find that 

(1 — M*)t'" — 2(1 m I + l)pv’ + (X — 1 m I — m^)v = 0, (44) 


where v' = dv/dp, and so on. 

We now proceed to solve (44) by power series; the method is 
similar to that used in the case of 17^, and we shall outline it 
but briefly. We let 


V — ciiP*, f = T, r + 1, T + 2, - . ■ , (45) 

where t and the a’s are as yet undetermined constants and a, ^ 0, 
and, substituting (45) into (44), get 

— I)/**”* + — (i + I TO |)(» + 1 m I + } = 0. (46) 



410 


THE SCHROEDINGER METHOD 


§74 


The coefiBicient of (46) is 

(fc + 2)(fc + l)(Zfc+2 + [^ — "f* 1 ^ 1)(^ "1“ 1^1 "I” • (47) 

In order that (46) may hold, the coefficient (47) must vanish for 
every value of fc, so that 

gjb+2 _ (fc + I ^ |)(fe + I m I + 1) — X /^g\ 

a, ^ . (fc + l)(fc + 2) 

To determine t, we note that the coefficient of the lowest power 
of fij that is, of in (46) is arr(r — 1), so that the equation 
— 1) = 0 must hold, and we have the two possibilities r — 0 
and T = 1. The series, which we denote by Vo , obtained from 
(45) by letting r = 0, letting ao = 1, and omitting the odd powers 
of fj,, that is, 

Vq{ii) = 1 - f " "{■ * ’ ' ? ( 49 ) 

is a particular solution of (44), provided of course that the a's 
satisfy (48) ; the case r == 1 3 rields another particular solution, 

Viifj) = ju 4- azfM 4 a^fx 4 * • * > (®^) 

and the general solution of (44) is 

t; = AVoifx) 4 57i(m). (51) 

If the series (49) terminates, that is, if Fo is a polynomial, then. 
7o is finite throughout the region — 1 < m < 1 when sub- 
stituted into (43), yields a well-behaved 0. On the other hand, 
if (49) is an infinite series, then Vo , when substituted into (43), 
yields an ill-behaved 0. Indeed, for ja = 1 we have 

yo(l) = 1 4 02 4 a4 4 • • • ; (52) 

this series diverges, and a more complete study shows that the fac- 
tor (1 — in (43) fails to keep 0 finite when v is infinite. The 
series Vi behaves in a similar way. Thus the permissible values 
of \ are those which lead to a terminating Vj that is, the values 

X = (fc 4 I w l)(fc 4 1^14 1), (63) 

which, according to (48), cause the vanishing of 0 * 14 . 2 , 0 * 44 , o^+e , 
^d so on. 

Therefore, the only way to obtain a weU-behaved solution of 
the Equation (42) is 



411 MOTION IN THREE DIMENSIONS §74 

(a) to let X = (A: + 1 m |)(ifc 4- I m 1 + 1), where fc is a positive 
integer or zero (thus causing Fo to terminate if A is even, or Fi 
if A is odd), and 

(b) to set the arbitrary multiplier of the nonterminating series 
in (61) equal to zero. 

Since ] m ] in (53) takes on the values 0, 1, 2, • • • , and since A 
(the exponent of the highest power of ju in the polynomial v) 
also takes on these values, we may set 

A + |m| = 1, I = 0, 1, 2, ... (54) 

and rewrite (53) as 

X = 1(Z + 1), Z = 0, 1, 2, . . . ; (56) 

the eigenvalues of the Legendrian are thus the special integers 0, 2, 
6, 12, 20, ... . 

The well-behaved solutions of the d-egucAion. Since our interest 
is restricted to well-behaved solutions, we may now rewrite (42) as 

H ~ ^ ~ 

and (44) as 

(1 - - 2(1 m I IW + ll(l+l)-\ml- = 0. (67) 

It follows from (54) that, for a preassigned Z, only the following 
2Z -1- 1 values of m yield polynomial solutions of (57) : 

m= _Z,-(Z-1),..., -1,0,1,.. .,(Z-1),Z. (58) 

It can be shown’ that the polynomial solutions of (67) can be 
written, apart from a multiplicative constant, as 

" = • (S9) 

According to (43), a well-behaved solution of (56) can be 
obtained by multiplying (69) by (1 — the result, when 

divided by a conventional factor 2*ZI, is denoted by P|”' and is 
called an associated Legendre function of the first hind, except that 
when wi = 0 the symbol Pi is used instead of P? and the resulting 


> Jeans, §210. 



412 


THE SCHROEDINGER METHOD 


§74 


function is called a Legendre polynomial. Thus, apart from an 
arbitrary multiplicative constant, the well-behaved solutions of 
(56) are 

, 1 ji+lml 

pI”'(m) = ^ (1 - /)"”■' Itth 

For many quantum-mechanical purposes, however, it is par- 
ticularly convenient to introduce the following function,^ to be 
denoted either by &{lm) or by 9^: 

Qilm) = er = (_ i)H".+l»l) + ^ - 1^1) . ! (61) 

V 2(Z + |ml)! 

and to write the well-behaved solutions of (56) either as 0 — 
AQ(lm) or as 0 = 40?, where A is an arbitrary non vanishing 
constant. We shall ordinarily denote the function (61) by 0? , 
and shall reserve the symbol 0(i m) for cases that would other- 
wise involve complicated typesetting. 

A few 0?'s, grouped according to the values of Z, are: 



* Condon and Shortley, page 62. 



413 


MOTION IN THREE DIMENSIONS 


§74 


~ ~ “ ^|/| 5 — 3 COS e) 

©3= -/j/f (1 -mV(5m*-1) 


'n 



32 


(6 sin 6 cos* d — sin 8) = —63 ’ 


ft* j / I 05 /« 3\ /l05 • 2 . . 

^ Y Te ~ “ 4/ "ie ^ ® 

The 9?’s satisfy the equations 


= er' 


= -er. 


i: 


e?(jtt)er<(M)dM 


~ Jo 6)&ii(cos 8) sin 8 dd = 


ifj = I' 
iff V 


(62) 


and are therefore said to be ‘orthonormal in 8.’ 

The eigenfunctions of the Legendrian. We now recall (33), 
multiply (61) by (38), and, for the moment, set H = 0. The 
result, that is, the function 


A&^e 


.m ijntff 


(63) 


is an eigenfunction of the Legendrian belonging to the eigenvalue 

l(l + 1). 

Now, the value of I having been fixed, the parameter m in the 
fl-equation can be given any one of the 2f + 1 values (58). The 
products of the corresponding ©-factors (61) with the first term 
in the ^-factor (38) are, if we omit the arbitrary multipliers. 


. . . , 07 V*, 0 ?, . . . , 




ere 


, eie 


%1<I> 


(64) 


Since 0^ ”* = ( — and since the arbitrary constants are 
omitted, the 0^s with negative superscripts may be replaced by 
the 0's with the corresponding positive superscripts — this is not 
unexpected, because the ^-equation is unchanged when m is re- 



414 THE SCHROEDINGER METHOD §74 

placed by — m; but we shall adopt the notation of (64) on account 
of its symmetry. 

When we turn to the second term in the 4»-f actor (38) and 
proceed as before, we jfind that, since this term differs from the 
first only through the sign of m, it contributes nothing new to our 
list (64). Hence the eigenfunctions of the Legendrian belonging 
to the eigenvalue Z(Z + 1) are just the 2Z + 1 functions (64), 
their multiples, and their linear combinations. The members of 
(64) are linearly independent, and hence the eigenvalue l(l + 1) 
of the Legendrian is degenerate to the degree 2Z + 1. 

It is convenient to define the symbols T(Z m) and T? by writing 

im^ 

ni m) = T? = earn) 1_ = e? ® (66) 

■V2t •V2ir 

and to rewrite the list (64) as 

T7‘, T7‘+', . . • , T7‘, T? , Ti , . . . , t|-‘, T| , (66) 

the compensating factors \/^ being absorbed in the unwritten 
arbitrary multipliers. The T's and their numerical multiples are 
called surface spherical harmonics. Any well-behaved function 
of Q and </> can be expressed as a linear combination of surface 
spherical harmonics,^ and consequently our having restricted our- 
selves to solutions of the Legendre equation of the rather special 
form w = 04> has not impaired the generality of our results. 

A few T's, grouped according to the values of Z, are: 


tS* 

s/h, 


T7' = 





T7' = 

^ ~ sin B cos 6 

TI' = 


sin 9 < 



- T? = 

>/l 

cos 9 


— sin 9 cos 6 e** 

Ti = 

"i/ 

~ am d 
ox 

k. 




415 


MOTION IN THREE DIMENSIONS 


§74 


We finally note that, if 22 is any well-behaved function of r, 
then the function 

uim = RTi , where \ (07) 

[m = 0, ± 1, ± 2, , ± Z, 

is a simultaneous eigenfunction of the Legendrian and of the 
operator —id/d<l>j belonging to the eigenvalue Z(Z + 1) of the 
Legendrian and the eigenvalue m of —id/d<l>j conversely, every 
simultaneous eigenfunction of the Legendrian and of d/d(t> has 
the form (67). 

Exercises 
1. Show that 


^ •/» 91 sin 0 9 

— =* sin 9 cos ^ 7 - 4 - - cos $ cos <i> — 

9r f 99 rsin $d(f> 


dx 


^ • * 9 1 9 1 cos d> 9 

r- « sin 9 sin 0 - 4 - cos 9 sm H — 

9r r 99 rsin 9 9<^ 

7 “ = cos d— — sm 9 — 
dz dr r 99 


9 9.9 9 


dz 

9 


9 9 0 

27 ir "= cos ^ r: — cot 9 sm , 

• 92 99 90 


dx 


( 68 ) 

(69) 

(70) 

(71) 

(72) 


-1 _ -1 1 . 

^dx‘‘d<t> 

/a 9 V . / a 9V . / 9 a \2 

- — —sin a— + -^ — 
sin 9 99 99 sin* 9 90* * 


(73) 

(74) 


2. Show by comparing the Cartesian forms® of the operators (71) and 


(73) that the eigenvalues of i sin 


0 7- 4 ^ cot 9 cos 
99 



are the integers (29). 


' That is, the forms given on the left sides of(71) and (73). 



416 


THE SCHROEDINGER METHOD §74 

3. Let [a, denote the commutator — Pa, and show that 


d 

d 

d 

0 

/ a 

b\ 

(75) 



^dx~ 

dzj 

\ 


d 

d 

b 


1 / a 

d\ 

(76) 


dz 

X- 

by 

J 


^dy) 

a 

d 

b 

3l 

1 / ^ 

b\ 

(77) 

x ~ — 

dy 

^ bx 


z — 

syj 


^bz) ‘ 


4. Recall Exercise 7® and show that, if u is a simultaneous eigenfunction 
of any two of the operators (71), (72), and (73), then it is also an eigen- 
function of the third, and the eigenvalues of (71), (72), and (73) belonging 
to this u are all zero. 

5. Using first their Cartesian and then their polar forms, show in detail 
that each of the three operators (71), (72), and (73) commutes with (74). 
Note that this result is a special case of that of Exercise 8®. 

6. Using the Cartesian forms of the operators (71)^ to (74), show in 
d etail that each of these operators commutes with the operator r *=» 
Va;* + 2/® 4* a* and with every operator involving the operator r only; 
verify by inspecting the polar forms. 

7. Use (61) and (60) to show that 

zTl C78) 


and that 0^ is identically zero if | m | > Z. 

8. Extend the table of e's on pages 412 and 413 through I = 4. 

9. Verify Equations 2-^ to 6-^ for sundry special cases using e's from the 
table on pages 412 and 413. 

10. Use Equations 3^ to 5^ to show that 


sinf?~er - - (Z + l)Brer-i. 

11. Show that if m 0 then 2^ follows from 3*^ , 4^ , and 6^. 

on page 414 through Z « 3. 

fK + *• granted, show that the functions (65) form a set 

that IS orthonormal in the angles, that is. 


n: 




(79) 


if I = I' and m <= m' 
otherwise. 

76, Remarks on Classical Mechanics 

-1 derivatives x, 'y, and z of the Cartesian coordinates of 

a particle are the Cartesian components and .. of its 



4:17 


MOTION IN THREE DIMENSIONS 


§75 


velocity; the velocity itself is a vector, denoted by v. The quan- 
tities 

Px = mxy py = my, pz = mz (1) 


Vu = 


Pz = mz 


(where m is the mass of the particle), are the components of the 
linear momentum of the particle; the linear momentum itself is a 
vector, denoted by p. In vector 

notation, the three equations (1) zi / 

are summarized by writing y 

p = mr, (2) 4 

where r is the time derivative 
of the position vector r. 

If we consider Fig. 102 and 

compute the 'moment^ of the 0 y ^ 

vector p about the a:-axis (in the 
manner used in statics to com- 
pute the moment of a force about 

an axis), we find the result to be ypz — zpv ; similarly, the re- 
spective moments of p about the 2 /-axis and about the «-axis are 
zpx — xpz and xpy — ypx . We write 

Lx = ypx - zpy , Ly = zpx - xpz , Lz = xpy - ypx , (3) 

and call L* , Ly , and L, the components of the orbital angular 
momentum of the particle about the point 0. The orbital angular 
momentum itself is a vector, denoted by L; its magnitude is 
denoted by L, so that 

L* = L* + Lj H- L\ . (4) 

Orbital angular momentum is often called by the more descriptive 
name moment of momentum. In vector notation, the three equa- 
tions (3) are summarized by means of the equation 

L = rXP, (5) 

which is read “L equals r cross p.” 

Let the force (a vector) acting on the particle when the particle 
is at° xyz be denoted by f(x, y, z), or simply by /, and let its 


® We exclude from consideration the case when the force depends on 
the velocity of the particle, as it does, for example, when an electrically 
charged particle moves in a magnetic field. 



418 


THE SCHROEDINGEE, METHOD 


§76 


Cartesian components be denoted by y, z), fv(x, y, z), and 
f,(x, y, z), or simply by fx, ft , and f , . The classical equations 
of motion of the particle then are 

m£=fx, my = ft, mz = ft. (6) 

In vector notation, the three equations (6) are summarized by 
writing 

mr = /. 

If there exists a function of x, y, and z, which we denote by 
Y (x, y, z), or simply by V, such that 


3F . dV ^ 

dx ’ dy ’ •'* dz ’ 


(7) 


then V is called the potential function of the system, or simply the 
potential, and the force field is said to be conservative. In vector 
notation, the three equations (7) are summarized by means of the 
equation 

/ = -grad V, (8) 

-which is read “/ equals minus the gradient of V.” 

Whenever Equations (7) hold, the quantity 

im{£‘ + f + i) + V{x, y, z) (9) 

remains constant as the particle proceeds with its motion. This 
quantity is called the energy of the particle and is denoted by JS; 
the first term in (9) is the kinetic energy of the particle, while 
Y (x, y, z) is its potential energy. If (9) is expressed in terms of 
X, V, 2) Pa,Pv, and p, (rather than the terms of X, y, z, ±, i, and z) 
the result is called the Hamiltonian function of the system, or 
simply the Hamiltonian of the system, and is denoted by H: 

^ (p* H- + pl) + Y(x, y, z). (10) 

To indicate that the energy of the system preserves its numerical 
value E, we write 


H = E. 


( 11 ) 



419 


MOTION IN THREE DIMENSIONS 


§75 


The set (6) of three simultaneous second-order differential 
equations is equivalent to the following set of six first-order differ- 
ential equations, called Hamilton’s canonical equations: 



If the six variables Qi, qt , Qs, Pi , Pi , and ps are such that H 
can be expressed entirely in terms of them and that when this is 
done the six equations 


. dH 

*“ 37 : 


» - 1, 2, 3 


hold, then g,- and p,- are said to be canonically conjugate. Com- 
parison of (13) with (12) shows that, in particular, the coordinate 
X and the a^component p* of the linear momentum are canonically 
conjugate, that so are y and py , and that so are z and p, . 

The dynamical variable denoted by [|, tj] and constructed from 
two dynamical variables f and i; according to the formula 


[t „] = —3. 4 - ^ 

’ \dx dpx dpx dxf \dy dpy 9p, dy) 

\9z 9p, 9p, dzj 


(14) 


is called the Poisson bracket of i and n. The Poisson brackets of 
some of the simpler dynamical variables are listed in Table 17. 
We note, in particular, that 

[£/» , Zij,] = L, \Ly , Z/,] = Lx \Lx , Lx] = Ly . (15) 

Distribution functions. If the information available about a 
particle moving in three dimensions falls short of being equiva- 
lent to the specification of the precise initial conditions of the 
motion (of which there are six), then a study of the motion usually 
involves probability considerations. The following remarks are 
restricted to probabilities concerning the position of a particle; 
for definiteness, we write down our definitions in terms of a 
Cartesian frame. 



420 


THE 8CHROEDINGER METHOD 


§75 


Table 17 


Examples of Poisson brackets. [^, ri] is shown at the intersection of row 
labeled f and colunan labeled rj. 



X 

y 

z 

Px 

Pv 

Pz 

Lx 

Ly 

h. 

X 

0 

0 

0 

1 

0 

0 

0 

z 

—y 

y j 

0 

0 

0 

0 

1 

0 

—z 

0 

X 

z 

0 

0 

0 

0 

0 

1 

y 

—a? 

0 


-1 

0 

0 

0 

0 

0 

0 

Vz 

-Pv 

Pv 

0 

-1 

0 

0 

0 

0 

1 -Pz 

0 

Px 

p* 

0 

0 

-1 

0 

0 

■ 0 

Pv 

-Px 

0 

L, 

0 

z 

-y 

0 

Pz 

-Pv 


Liz 

-hy 

Ly 


0 

X 

-Pz 

0 

Px 

-L, 

0 


Li 

y 

— X 

HI 

Pv 

-Px 

H 

L„ 

-Lx 

0 


If the the 2/-, and the ^-coordinate of the particle have 
values lying, respectively, somewhere between z and x + dx, 
somewhere between y and y dy, and somewhere between z and 
z dzj then we say for short that the particle is ^in dxdydz at 
xyz. If the probability that the particle is in dxdydz at xyz is 
expressed in the form 


^dxdydz, ( 16 ) 

we caR P(a;, y, z). the (absolute)^ distribution function for position j 
the distnhution-m-position, the distnhution-in-xyz, or the proha- 
%y e^ty. The probability that the particle is inside a volume 
c^Puted by integrating P(x, y, z) over this volume; 
since the particle is certain to be somewhere in the' infinite region, 
we have, m particular, 

Jfh^. V, z) dxdy d:z = 1 . (17) 

y> the 

Mix V z) of the (expected) inexactitude, 

Mix, y, z), of a function f(x, y, z) are defined as follows: 

y, z) = J/ff{x, y, z)P(x^ g) ^ (18) 

A/(», y, 2 ) = /|/ Jfj [fix, y, 2) - av fix, y, z)fPix, y, z) dxdydz. ( 19 ) 

' Relive probabUities can be introduced in the manner of |11 . 







421 


MOTION IN THREE DIMENSIONS 


§75 


We note that 


(20“) A^ix, y, z) = av [f{x, y, z)f - av*/(x, y, z). (20) 

The formulas (18) to (20) apply of course also in the cases when 
the fimction denoted by fix, y, z) depends in fact only on x, or 
only on y, or only on x and y, and sb on. 

If the distribution-in-xyz depends on the time t, and thus has 
the form Pix,y,z,t), it is called nonstationary, otherwise it is 
called stationary. 

If the distribution-in-a;y 2 vanishes or nearly vanishes every- 
where except ill a limited region, it is sometimes called a probability 
packet] the point with coordinates av av and av z is then 
called the center of the packet. The distribution function 




1 

<Tx a’*(27r)^ 


(a— gp) ^ 


(y~yo) ^ 
2<rg 


2(ri 


( 21 ) 


where xq j 2 / 0 , zq ) and the <r's are constants, is an example of a 
stationary probability packet. The center of this packet is at 
the point XoyoZo ; should one or more of the quantities xo j 2 / 0 , 
and Zq depend on Z, the packet would be a moving one. 

Certain distribution functions are conveniently expressed in 
terms of the polar frame. For example, if the parameters in (21) 
satisfy the relations. Xq = yo = zq = 0 and <Tx = (Xy = <r, ^ a, 
then (21) becomes P(x, y, z) = [cr®(27r)*]~^ oxp (_7'720, so that 
the probability density at the point is 

1 2(r* 

~7m'‘ ’ 

the probability of finding the particle in the polar volume element 
r dr sin B dS d<l> at the point r6(l> is consequently 

- rl 

P(r, e, <t>ydr sin eded<p= e sin & de dd>. (23) 

<T (27r}’ 

The distribution (22) is seen to be spherically symmetric, that is, 
independent of the angles 6 and <j)] we denote it neverthelos.s by 
P{r, 6, 0), rather than simply by P(r), in order to emphasize the 
fact that it does contain information about the angles as well as 
about r. 



422 


THE SCHROEDINGER METHOD 


§75 


The probability that the distance of the particle from the 
origin has a value lying between r and r + dr (that is, the proba- 
bility of finding the particle between two spheres with radii r 
and r + dr and with centers at 0) is often of interest; if this 
probability is expressed in the form Pr dr, then Pr , a function 
of r, is the distribution-in-r, ’We have, by definition, 

m r+dr 

P(r, <l>)r^ dr sin 6 dd d<j> (24) 

or simply (see 10^), 

Prdr “ 4^) sin ^d^d^J dr. (26) 

For example, in the special case (22), which is particularly simple 
because of the spherical symmetry, we have 


Pr- 


47r . -A 


a^(2ir)^ 


r^e 


r2 

"2ffi 


(26) 


The reason for our denoting the distribution-in-r by Pr rather 
than by P(r) is to diminish the possibility of confusing the symbol 
for this distribution with P(r, that is, with our symbol for 
the polar form of the probability density. 

Probability currents. If the distribution-in-ajy^: is nonsta- 
tionary and thus has the form P(xj y, z, t), we may speak of it 
as flowing from place to place and may introduce the notion of 
probability currents, or, more precisely, j>rohahility-‘Currerit densi^ 
ties, as follows: Let dA be a small area located at the point xyz 
and p^allel to the 2/2-plane; the time rate at which the probability 
flows in the positive aj-direction across this area at the instant t 
IS then proportional to dA and is in general dependent on x, y, z, 
md t, so that we may express it in the form S:c(x, y, z, t) dA ; the 
unction Sa.(a:, y, 2, i), or simply Sx , is then called the a;-component 
o t e probability-current density at xyz at the instant it is 
negative whenever the flow is in the negative a;-direction. The 
y- md the 2-component of this current density are defined in a 
similar m^er and axe denoted by s,ix, y, z, t) and Ss(x, y, 2, <), 
T current density itself is now defined 

ft components axe s,, s„ , and s,, and is 

c y *(*» y, z, t) or simply by s. The direction of 5 at any 



423 


MOTION IN THEBE DIMENSIONS 


§75 


point is that of the total current at this point, and the magnitude 
of s at any point is that of the total current density at this point. 

Since probability is, so to speak, neither created nor destroyed, 
the timfi rate at which the probability of finding the particle in a 
given volume increases should equal the time rate at which the 
probability enters this volume across its surface (if the probability 
of finding the particle within the volume actually decreases, then 
the rate at which it increases is negative, and probability leaves 
rather than enters across the surface) . To express this conserva- 
tion property in mathematical terms, we consider the volume 
element dxdydz about the point xyz, shown in Fig. 103. The 
probability of finding the 
particle in this volume is 
P(x, y, z, t) X dxdydz or 
simply Pdxdydz, and the 
time rate at which this prob- 
0jP 

ability varies is — dxdydz, 
ot 

■ The rate at which probabil- 
ity flows irdo the volume 
through the shaded face 
marked A is Sx(x, y, z, i) X 
dydz or simply Sxdydz ; the rate 
at which probability leaves 

the volume through the shaded face B is Si(a: -f- dx, y, z, t) dydz 
or [when terms in dx’^, da:*, and so on, are omitted] s*(a:, y, z, t) dydz 

-f V’ b dxdydz, that is, s4ydz -|- ^ dxdydz ; the rate of net 

bx vX 

influx through the two shaded faces is consequently dxdydz. 

The two remaining pairs of faces contribute to the net influx the 



Fig. 103 


d$i 


d$g 


— dxdydz and — dxdydz, 
by bz 


Equating the total influx to 


terms 

— dxdydz and canceling the factor dxdydz^ we now get the so- 
bt 

called continuity equation 



424 THE SCHROEDINGER METHOD §75 

which summarizes the conservation of probability in mathematical 
terms. In vector notation, Equation (27) is written as 


divs = — 


dt 


(28) 


and is read “the divergence of 5 equals 

at 

Plane complex waves. If jky j and c are real constants, 
then the equation 

kxX + kyy kxZ = c (29) 


is the equation of a plane. We shall leave it to the reader to 
verify that the a;-, the 2/-, and the ^-intercept of the plane (29) 
are c/fc* , c/ky , and c/h ; that the a;-, the y-, and the £;-coordinate 
of the point in this plane lying nearest to 0 are fc*c/(A* + fcj + fc«), 
kyc/{kl + kl + kl), and kzc/{kl -f + kl); that the distance of 
this plane from 0 is \ c \/(kl kl ^ k^f; that the cosines of the 
angles ax, ay ^ and a* which a normal to this plane makes with 
the x~, the y-, and the z-axis are 


V W XJLy rrs s I 

Vkl + 

, (30) 

kz , 

VJ^+kl + kl' 


kxX + kyy + kzZ — o' (31) 

is paraUel to the plane (29) and lies a distance | c - c'| /{kl + kl 
+ k^/f from it. 

Let k denote the vector with components kx , ky , and K ; the 
cosines of the angles that this vector makes with the three axes 
are just (30), and consequently the plane (29) is perpendicular to 
the vector ft. In vector notation, (29) is written as 

ftr = c (32) 

and is read ft dot r equals c.” The magnitude of ft will be 
denoted by fc, so that k = ’+ kl + kl 

If i is the time, A is a constant (real, imaginary, or complex), 


cos a- = 


VW+K + K' 

cos at = 


and that the plane 



425 MOTION IN THREE DIMENSIONS §75 


V is a positive constant, and fc* , fey , and fc* are real constants, 
then the expression 


= ^g2ri(A:*a:+/:yi/-|-fcir— Vi) 


(33) 


or, in vector notation. 


u 


== Vi) 


(34) 


is called a plane monochromatic complex wave. This wave is said 
to be plane because, at any fixed instant, u has a constant value 
over any one plane perpendicular to the vector k, A is the 
(complex) amplitude of u] v is the frequency of u [note that at 
any fixed point in space u has the form constant X the 

quantity 

'' ■ Vkl + « + i! ■ S 

is the wave length of u [note that if the distance between two planes, 
both of which are perpendicular to ft, is an integral multiple of 
(35), then the value of u at any instant is the same over both 
planes], and k is the wave number of u. As time goes on, the 
values of u drift, so to speak, in the direction of the vector ft 
[which for this reason is called the propagation vector] with the 
velocity 

v^ = 'hvj (36) 

called the phase velocity of u. 

The plane monochromatic waves encountered in the classical 
theory of the vibrations of continuous media are real waves, 
expressible in the form® 

- Re (37) 

where the symbol Re is that of 2®. 


Exercises 

1. Verify Table 17. 

2. Show that Equation 16’, that is, ^ = If, holds in three-dimen- 
sional motion whenever f does not involve t explicitly. If f = 0 for all 
time, then f is called a constant of motion of the system. 


® lif in (37) is to be the true amplitude of the real wave, then it must 
be required to be real, and a further real constant, the initial phase (f>j 
must be added within the parentheses of the exponent of (37). See 16*®. 



426 


THE SCHROEDINGER METHOD 


§75 


3. Show that Equations 14’ hold in the three-dimensional case. 

4. Show that, if f(r) is an arbitrary fun ction of r and/(p) is an arbitrary 
function of p, where p = Vpl 4- P? + p5> then 

[i, , /(r)] = IL, , /(r)] - [L. , /(r)] = IL>, f(r)] = 0 (38) 

[ix , /(P)l = IL, , f(p)] = [L. , /(p)] = [L*. /(p)] = 0. (39) 

5. The quantity mr, denoted by pr , is called radial momentum. Show 
that, when the force field is conservative, the energy of a particle can be 
expressed in the form 

^ Pr + n — 2 
2m 2mr® 

where V{r, 4>) is the potential energy expressed in terms of polar co- 
ordinates. 

6. If the force acting on the particle is everywhere directed either toward 

or away from the origin 0, then the force field is called central or radial. 
Show that, if the potential energy of the particle depends only on the 

distance from 0, that is, if the potential function has the special form 

V = 7(r), then the force field is central. Show also that, if the force 
field is central and conservative, then the potential function has the form 
7 - 7(r). 

7. Show that, if the force field is central and conservative, then Lx , 
Ly f Ltt and 1/ are constants of motion. 

8. Show that some of the constants of motion of a free particle moving 

in three dimensions are jET, , Pv , , L* , Ly , L, , and L\ 

9. A particle attracted toward 0 with a force proportional to its dis- 
tance r from 0 is called an isotropic spatial harmonic oscillator, or simply 
an isotropic oscillator] its potential function is 7 (x, y, z) = ik(x^ + 2/® + 

or simply 7 (r) = where k is the restoring constant. Show that the 
quantities H, L« , Ly , L, , and L®, and also the quantities » 
pl/2m + ikx^, = pl/2m + iky^, and = pi /2m + \kz^ are each a 
constant of motion of the isotropic oscillator. 

10. Verify that for the distribution (21) av a; a;o , av p ~ po , av 2 = «o , 
Aa: = or* , Ay - o-y , and A 2 = o-, . 

11. Suppose that a particle confined inside a sphere of radius R with 
center at 0 is equally likely to be an3rwhere within this sphere, and show 

that then av r = \R, Ar = S. av x = 0, and Ax = /j/^ 

12. Show that the function (33) — a plane wave — is a simultaneous 

‘ eigenfunction of the operators —iO/da;, -i b/dy, —id/dz,and — V*, belong- 
ing to the respective eigenvalues 2irA;, , 2irky , 2'!ch» , and AtKK + 4- K) 

of these operators. 

76. Elements of the Schroedinger Method 

According to the operator assumption I of §13, the properties 
of dynamical variables pertaining to three-dimensional motion are 



427 


MOTION IN THREE DIMENSIONS 


§76 


investigated in quantum mechanics through a mathematical study 
of the properties of operators to be associated with these variables. 
These operators are required to satisfy the quantum condition 

(l^®) afi ^ pa - ih[a, /3], (l) 


the formula for the Poisson bracket now being 14^®. It follows 
from (l) and Table 17 that the operators x, y, px, Pv , and 
Px (whatever their specific forms may be), associated in any 
quantum-mechanical scheme with the coordinates and the com- 
ponents of momentum of a particle, must satisfy the following 
commutation rules: the operators a;, y^ z must commute with 
one another; the operators p* , py , and p* must commute with 
one another; and we must have 


xpx — PaX = ih ypy — pyy = ih zpz — PtZ = ih (2) 

2/P* “ P*2/ = 0 zpv - py2; = 0 xpx - PtX = 0 (3) 

zpx - p*z = 0 a;py - PyX = 0 pp, - p*p = 0. (4) 

Schroedinger operators. The Schroedinger method employs 

the following particular scheme of associating specific operators 
with the fundamental dynamical variables: 


dynamical variables x,y,z'^ operators x, p, z, respectively 

d d 

dynamical variables paj,py,p,,^ operators (5) 

respectively. 

I dz 


This scheme is a direct extension of the one-dimensional scheme 3^^. 
The operators (5) satisfy the requisite commutation rules, and 
their eigenvalues are all real. 

A further Schroedinger association, of which the associations 
(5) are special cases, is 


dynamical variable a(x, p, z) p{px , Pv , pi) 

^ operator «(», y,z) + P 4)‘ 


< 6 > 


The respective operators to be associated with dynamical 
variables of the most general form a{x, y, z, Px,Pv, p.), involving 



428 


THE SCHROEDINGER METHOD 


products of the coordinates and the p’s, must usually be each 
identified separately because of the noncommutability of x and p* , 
and so forth. 

According to (6), the Schroedinger operator associated with a 
Hamiltonian 


do”) 

^ (p* + Pw + P*) + V(x, y, z) 

(7) 

is 


<8) 

that is. 

H= -iv^ + V{x,y,z), 

K 

<9) 

where the operator V(Xj y, 2 ) is the same function of the operators 
X, 2 /, and z that the potential function V(x, z) is of the co- 
ordinates X, y, and 2 , and where, as before, 

(17“) 

2m 

(10) 


The Schroedinger operators are differential operators; accord- 
ingly, the Schroedinger operands are functions of a;, y, and Zj 
or of whatever other coordinates may be employed. These 
operands, denoted by ^(a;, y, 2 , t), by ^(r, 6, 0, or simply by 
are required to depend on the time t so as to satisfy the second 
Schroedinger equation 1^®; but for the present we shall disregard 
their time dependence and shall thus restrict ourselves to instan- 
taneous ^’s. 

When the Schroedinger operator associated with a dynamical 
variable a has been identified, then the computation of the pos- 
sible values of this dynamical variable reduces, according to 
the operator assumption I of §13, to the computation of the 
eigenvalues of the operator a, that is, to the computation of the 
values of the number X for which the eigenvalue equation 

oapix, y, z, t) = hl/{x, y, z, t) (ll) 

or simply a<p = \4/ <12) 

has well-behaved solutions. In particular, the computation of 
the energy levels of a system whose Hamiltonian is H reduces to 



429 


MOTION IN THREE DIMENSIONS 


the calculation of the values of the number E for which the fol- 
lowing equation has well-behaved solutions: 



11 

(13) 

that is, 

|^_ lv + V{x,y,z)j^ = E4', 

(14) 

that is, + 


(15) 


Equation (13), in its fornas (14) or (15), is called the (three- 
dimensional) first Schroedinger equation^ the Schroedinger am- 
plitude equation, the Schroedinger wave equation, or simply the 
Schroedinger equation. Note that (14) and (15) must be modi- 
fied whenever a magnetic field is involved. 

We shall take it for granted that the equation of motion 

<27*‘> a = \iHa- aH) (16> 

Tl 

holds in the three-dimensional case; this equation tells us how to 
construct the operator associated with the dynamical variable a 
from the operators associated with a itself and with F. If a 
commutes with H, then the operator a is zero, and a is called a 
constant of motion of the system. Just as in the one-dimensional 
case, if a: is a constant of motion, then the time derivatives of the 
expected averages of a and of all of its powers are zero for every 
state of the system, and consequently the distribution-in-a is 
stationary for every state of the system. Again, just as in the 
one-dimensional case, a dynamical variable that is a constant of 
motion of a classical system is also a constant of motion of a 
corresponding quantum-mechanical system.® 

We shall also take it for granted that the expansion theorem of 
§46 holds in the three-dimensional case. 

The remainder of this section will be devoted to illustrations 
of the use of the eigenvalue equation in the Schroedinger method. 

Linear momentum. In the case of the a;-component of the 
linear momentum, the eigenvalue equation (12) can be written as 

= vi'l', (17) 

* Exceptions to this assertion arise, however, in the relativistic theory 
discussed in Chapter XIV. 



430 THE SCHROEDINGER METHOD §76 

■where p» is the operator associated with the dynamical variable 
p* and where the symbol p* replaces the symbol X; in the Schroe- 
dinger scheme we have, explicitly, 

—ih-^4' — (18) 

The solution of (18), 

= A(y, (19) 

(where Aiy, z) is an arbitrary well-befhaved function of y and z), 

is well-behaved for every real value of p, , and hence the eigen- 
value spectrum of the operator extends continuously from •— 
to CO. Accordingly, just as in classical mechanics, any real 
value of px is a possible one, so that px is not quantized. 

In the case of the dynamical variables py and p* , the computa- 
tions and the results are of course similar to those given above : 
Py and Px are not quantized. The eigenfunction of the operator 
Py belonging to its eigenvalue py turns out to be 

U = Biz, (20) 

where Biz, x) is au arbitrary well-behaved function of z and x; 
the eigenfunction of the operator p. belonging to its eigenvalue 
pi is 

= c(x, (21) 

where C(z, y) is an arbitrary well-behaved function of x and y- 

The free particle; Cartesian quantization.^® Taking the poten- 
tial of a free particle to be V{x, y, z) = 0, we find that the 
Schroedinger Hamiltonian operator (8) is 

Our problem is to find the eigenvalues of this H, that is| the values 
of the number E for which the Schroedinger equation 
or, explicitly, 

has well-behaved solutions. 


“The tem Cartesian quantization means that the computations are 
carried out in Cartesian coordinates; incidentally, we shall find that the 
energy of a free particle is in fact not quantised. 



431 MOTION IN THEEE DIMENSIONS §76 


Our procedure will rely on the expansion theorem. As show-n in 
Exercise 8’^ the dynamical variables 


j Py 3 Ve (24) 

are constants of motion of a free particle in the classical theory. 
Each of the operators 

”■■“•'*5’ *’■”-*4' *’■ = -*5 <25> 

commutes with our operator Hj so that, in quantum mechanics 
also, the set (24) is a set of constants of motion of a free particle. 
Further, the operators (25) commute with one another, and 
hence (24) is a set of commuting constants of motion. Conse- 
quently, in looking for the eigenvalues of our we may, according 
to the expansion theorem, restrict our attention to eigenfunctions 
of H that are simultaneously eigenfunctions of the three p’s. 

The respective eigenfunctions of p*, py , and p^arc (19), (20), 
and (21), so that a simultaneous eigenfunction of the three fs 
has the form ^ that is, 

if' = (26) 


■where A is an arbitrary constant (independent of either x, y, or z) 
and where p* , arbitrary red constants. It thus 

remains to adjust the constants in (26) so as to make it an eigen- 
function of our H belonging to the eigenvalue B. We have 


= 1 (p'® + -f 
jim 


(27) 


and consequently the proper adjustment is to choose the f’s in 
such a way that 

^ (pT 4- pT 4- p(“) = E. (28) 

Equation (28) is to be expected by classical analogy. 

Now, the p"s being real, Equation (28) cannot be satisfied 
if E is negative; but,^jf E > 0, then a suitable choice of the p”s 
can always be made. It follows that any positive JE, zero in- 
cluded, is an eigenvalue of our H, that is, that any positive energy 



432 THE SCHROEDINGER METHOD §76 


value, zero included, is a possible energy value of a free particle 
(when the potential V{x, y, z) is taken to be zero). 

We note that, if J? = 0, then (28) can be satisfied in only one 
way, namely, by letting p^ — Py — p'z = 0, in which case (26) 
reduces to a numerical constant; but, if > 0, then the p”s can 
be chosen in an infinite variety of ways, different choices of the 
yielding linearly independent ^'s. Consequently, the eigen- 
value 0 of our H is nondegenerate, while every other eigenvalue 
of H is degenerate to an infinite degree. 

According to the expansion theorem, any eigenfunction of H 
belonging to the eigenvalue is a superposition of the solutions 
(26) whose p”s satisfy (28); and, since (except when ^7 = 0) 
the p'*s can vary continuously within certain limits, the super- 
position may be either discrete or continuous. 

Se^paration of variables. The solutions of (23), found above with the 
help of the expansion theorem, can be found also by separating the 
variables in (23). Restricting our attention to ^’s of the form 

= XYZ (29) 

[that is, = X(x)Y(y)Z(z)], substituting (29) into (23), dividing through by 
—XYZ Ik, and using primes in the conventional way (compare 10*®), we get 




-kE. 


(30) 


Equation (30) haS the form /(a;) + g{y) -j- h{z) = constant, and, since x, y, 
and z are independent variables, it follows that the terms X" JX, F"/F, 
and Z'*/Z are each a constant. Thus we may write 

X"/X = -a®, F'VF - Z^^IZ » -c®, (31) 

where, in accordance with (30), the constants a, h, and c must satisfy the 
equation 

a® + 6* 4- c® = (32) 

The respective general solutions of the ordinary differential equations 
(31) are 

X = -f Aae-'*", Y = + Bae"**', Z = Cie<« -|- C2e“*«. (33) 

Substituting these solutions into (29), we get a sum of eight terms ; and, 
since (32) allows a, h, and c to have either algebraic sign, each term has 
the form 


4 ' — ile* <«*+*»+«). 


(34) 



433 


MOTION IN THREE DIMENSIONS 


To obtain good behavior, we must now restrict a, and c to be real. 
Writing a = vl/K ^ = Py/^j and c = pi /A, we may convert (34) into (26) 
and (32) into (28). 

It should perhaps be emphasized that the present procedure is no more 
rigorous from the mathematical standpoint than that which uses the 
expansion theorem : to complete the argument in either case, it is necessary 
to show that every well-behaved solution of (23) can be expanded as a 
linear combination of the special solutions (26). Our assumption that 
the expansion theorem is correct amounts in the present case just to the 
assumption that such an expansion is possible. 


The isotropic oscillator; Cartesian quantization. In the case 
of this oscillator, defined in Exercise the potential is 
+ + 2 ^), so that the Schroedinger Hamiltonian operator is 


H = 



4- 4- il' 


+ ~ kijx? + 2 /^ + i), 


(35) 


and the Schroedinger equation, = Eyp, is 

We shall again rely on the expansion theorem. The quantities 


tjA _ t 2 

•2m 




= 






(37) 


axe, as shown in Exercise 9’'®, constaxits of motion in the classical 
case, so that they are also constants of motion in the quantum 
theory of the oscillator. Further, the corresponding operators 


= 


19 ill.* 

K dz‘ 2 ’ 


H 


^,,2 ~ 9 , 


« dy^ 


1 a' 


K dz^ 2 


(38) 


commute with one another, and consequently the set 

H, (39) 

is a set of commuting constants of motion. In looking for the 
eigenvalues of H, we may, therefore, according to the expansion 
theorem, restrict our attention to simultaneous eigenfunctions of 
all four quantities (39). 



434 


THE SCHROEDINGER METHOD 


§76 


The operators aud are all of the same form and are 

all simpler than the operator H, which is their sum. We begin 
with H'^. The operator 

is identical with 2^®, that is, with the Hamiltonian operator of a 
linear harmonic oscillator, and consequently its eigenvalues and 
eigenfunctions can be identified by referring to 15^^ and to Ex- 
ercise 2^\ Replacing the symbol n by a, denoting the eigenvalues 
of by Et and the corresponding eigenfunctions by and 
disregarding normalization, we then get 

Ei = ia + ^)hvc a = 0, 1, 2, . . . (41) 

and 

K = Aiy, z)H„ , (42) 

where 2inie = y/kjm, a = -y^ h^/hm, and Ha is the ath Hermite 
polynomial. As the symbol A(y, z) implies, the arbitrary factor 
in (42), though independent of x, is allowed to be a (well-behaved) 
function of y and z. 

The operator H® differs from only through the replacement 
of a: by y; consequently its eigenvalues and eigenfunctions are 


and 

El = (|3 + ^)hv, /3 = 0, 1, 2, . . . 

(43) 


= H(z,a:)F^0e“^(») . 

(44) 

Similarly the eigenvalues and eigenfunctions oi arc 


and 

Ey = (y + 2)hy0 , 7 = 0, 1, 2, . • ■ 

(45) 


= C(rr,j/)H,Qe"K0\ 

(46) 


It follows from (42), (44), and (46) that the simultaneous eigen- 
functions of and which we denote by , are 




435 


MOTION IN THREE DIMENSIONS 


§76 


that is, 



where A is independent of x, 2/, and z. 

Now, 

<49) 

= Eitpctfiy + E^xl/afiy + Ey^a^y = {Et E$ + Ey)ll/afiy , 

so that 4^afiy is an eigenfunction of H belonging to the eigenvalue 

Eafiy = JSa + + jBy = + T + l)hVc (60) 

of H, Recalling the expansion theorem and the fact that the 
quantum numbers a, /3, and y may each have any value from the 
list 0, 1, 2, • • . , we conclude that the energy levels of the isotropic 
spatial harmonic oscillator are 

^hvc , ^hvc , ihvc , • • • . (51) 

Equation (50) involves a, jS, and 7 only in the combination 
a + i3 + 7; so that we may write it as 

Efi = (ti -|- i)h>Ve j rt == 1, 2, 3, • • • , (62) 

where 

n = a + ^+ y + L (53) 

The integer n (whose range is to be contrasted with that of the 
n in 15^®) is called the principal quantum number of the oscillator. 
Except in the case n = \ (when the oscillator is in its normal state, 
that is, its state of lowest energy), several distinct choices of a, 
j3, and 7 will yield the same value of 7^; the ^-functions (47) corre- 
sponding to these distinct choices belong to the same eigenvalue 
of H but are linearly independent, and consequently all excited 
energy states of the isotropic oscillator are degenerate. 

Components of orbital angular momentum. The dynamical 
variables 

(3^®) L, = yps - zpv , Ly - zpx - xps , L* = xpy - ypx (54) 

do not involve products of variables associated with noncommut- 
ing operators, and consequently the corresponding Schroedinger 



436 THE SCHROEDINGER METHOD §76 

operators can be identified at once by a reference to (5). In 


fact, we have 





L,=ih(z-^- 

d\ 

y -r ) 

= ih( sin <6 4; + cot 9 cos <j> 

-) 

(55) 

\ dy 

bzj 

V 99 

d4>/ 


L« = ih(x ~ — 

b\ 

^ 7“ 1 

= f — cos <9 4- cot 9 sin d> 

(56) 

\ dz 

dx/ 

\ dd 

(ip/ 


L, = ih(y - — 

d) 
® -x- 

co| 

1 

II 


(57) 

V 9* 

92/; 

^ d(l>' 



where the polar forms are taken from Exercise 




Of the operators {55) to (57), the polar form of Lz , that is, 


(58) 

d<j> 

is the simplest, and hence we shall restrict our attention to it; 
it is perhaps obvious that investigating Lz is equivalent to investi- 
gating the component of orbital angular momentum in any pre- 
scribed direction, because the coordinate frame can always be so 
chosen that its polar axis points in this direction. 

To find the possible values of Lz , we must compute the eigen- 


values of the operator (58), that is, the values of the number X 

for which the equation 


II 

3 

(59) 

or, explicitly. 



(60) 


has well-behaved solutions. Now, the operator Lz is h times the 
operator — i d/d<l> whose eigenvalucvS are the integers 29^^, and 
consequently the eigenvalues of Lz are h times these integers. 
Hence, the possible values of Lz {or, for that matter j of the component 
of L in any direction) are 

- • • , — 2Ji, —hj 0, hj2hj , (61) 

Abbreviating the words '^the possible values of the dynamical 
variable a are'' by the symbol “a we write this result in the 
form 


Lz = Mih, mi = 0, ±1, d=2, • • • . 


(62) 



437 


MOTION IN THREE DIMENSIONS 


§76 


The integer mi in (62) is called the axial^ magnetic^ or equatorial 
quantum number. The subscript I in mi serves to remind us that 
we are dealing with a component of the orbital angular momentum 
L, to be distinguished from the spin angular momentum that we 
shall take up later, The quantity ^-is called the Bohr unit of 
angular momentum. The formula (62) is common to both the 
quantum theory and the Bohr theory. 

It follows from 30^^ that the eigenfunctions of Lz belonging to 
the eigenvalue mih of Lz [we denote them by have the form 

(63) 


where A(r, 6) is an arbitrary well-behaved function of r and 6. 
The eigenfunctions of Lz form a set that is orthogonal in <!>. 

Magnitude of orbital angular momentum. According to 4^® 
and (55) to (57) the Schroedinger operator L^ associated with 
the square of the magnitude of the orbital angular momentum is 







that is, according to 74^^, 


\sm 


1/ . - V , X 

“s d$ dS sin^ 6 


)■ 


(65) 


The operator L^ is thus times the Legendre operator 31^^. 

The possible values of are the eigenvalues of (65), that 
is, f? times the eigenvalues of the Logendrian, which are listed 
in 55^^; hence, the possible values of the square of the magnitude 
of the orbital angular momentum^ are 


0, 2ft^ I2h\ 20ft^ .... (66) 


Using the notation introduced just above (62) we summarize 
this result by writing 

^ IQ + i)h\ Z = 0, 1, 2, .. . . (67) 

The eigenfunctions of L^ belonging to the eigenvalue IQ -f- 
are the 2Z + 1 functions 

(67"') mi). - RTQ mi) - 0, d=l, • • • , =hZ (68) 


and their linear combinations; the arbitrary well-behaved func- 
tions B of r in the set (68) are of course independent of one another. 



438 


THE SCHROEDINGER METHOD 


§76 


As recorded in 14^ and 15^, each of the functions in the set (68) 
is a simultaneous eigenfunction of and L* , belonging to the 
eigenvalue Z(Z + \)t^ of and the eigenvalue mih of ; hence 
the notation i/^(Z mi). 

The possible values of L, the magnitude of the orbital angular 
momentum, are the positive square roots of the possible values 
of 


L = + Z - 0, 1, 2, ... . (69) 

Comparison of (69) with (62) reveals a curious feature of our 
results: while the possible values of Lz are integral multiples of A, 
the possible values of L are (except for the value 0) irrational 
multiples of h. 

The integer Z in Equations (67) to (69), whose values are 
2, • • • , is called the azimuthal quantum number] it should 
not be confused with the azimuthal quantum number k of the 
Bohr theory. 

Exercises 

1. Compute the possible energies of a free particle, taking the potential 
to be V{x,yjz)- Fo = constant 9 ^ 0. 

2. Derive (48) and (52) by separating the variables in (36). 

3. Show that, for a preassigned n, the integers a, j8, and y in (53) can 
be chosen in in(n + 1) distinct ways,“ and that hence the multiplicity 
(degree of degeneracy) of the nth level of the isotropic oscillator is 
in{n + 1). 

4. Show that to normalize (48) we must set A = e*'(a82“'^^‘^'^a!i3lYln-*)"'^ 
where 5 is real and independent of x, y, and z, but otherwise arbitrary. 

6. A particle moving in the potential field 

V{x, y, z) ^ + JfcV 4- ik^z\ 

where the fc’s are constants not all equal to one another, is called an amso- 
iropic spatial harmonic oscillator. Identify a set of commuting constants 
of motion of this system, a set similar to (39) ; find the eigenfunctions of /if, 
both by the expansion theorem and by separating variables; show that the 
energy levels are = (a + i)hp^ 03 + i)hpf + (7 + , where 

a, 7 = 0, 1, 2, . . . , and 2 ^^ = "y/k^lm, and so on; point out the circum- 
stances imder which this energy spectrum is degenerate. 


Npte that this problem is identical with that of computing the number 
of ways in which n — 1 indistinguishable balls can be distributed among 3 
distinguishable boxes. 



439 


MOTION IN THREE DIMENSIONS 


§76 


6. Show that the quantum condition (1) requires the operators , 
Lj, j Lm , and L* to satisfy the following commutation rules: 

LxLjf ” LjfLx ” xfiLi f LyLz I^tLy — xfiLxt LaLx LxLz “ xfiLyf (7Qi^ 

and 


L,L* - L^Lx = LyL^ - L^Ly = = 0 <71> 

and verify that the Schroedinger operators Lx j Ly ^ L, , and satisfy 
these rules. 

7. Recall Exercise 4^^ and show that (63) is not an eigenfunction of Lx 
unless mi = 0, and A(rj d) depends on r only. 

8. Verify in detail that, if the force field is central and conservative, 
then the Schroedinger operators L* , Ly , L, , and each commute with 
the operator (8), and that consequently the classical result of Exercise 7^® 
holds in the Schroedinger theory. 

9. Show that in the Schroedinger scheme 


7>x db ipy 


.. • A ^ I ^ ^ 

the ( sm 5 -- H : 

y dr r d$ 


i 

r sin dd4>J 


Pm 


Lx db iLy 


,/ d sind 9\ 

he^'* fzk — + ieoi8 ^ 
\ dd d4> J 


2,.= 



(72) 

(73) 

(74) 

(75) 


10. If the potential F(x, y, a) is periodic in x, in y, and in a, with respec- 
tive periods a, b, and c, that is, if for every choice of x, y, and z 

F(*, y, z) V(.x + a, y+b,z + c). (76) 


then we call it a (spatial) potential lattice. 

Consider a particle moving in a potential lattice, recall the procedure 
of §48, construct three constants of motion similar to the r of §48, and 
show that in computing the energy spectrum of this system we may restrict 
ourselves to eigenfunctions of H having the special form 

^f/ » y, z)f (77) 

where kx ^ ky ^ and fc* are real numerical constants, and where the factor 
Vi has the periodicity of the lattice, that is, 

v[Xy y, z) “ v{x + a, 2/ + 6, 3 -f c). (78) 

This result is the three-dimensional form of the Bloch's theorem re- 
ferred to in §48. 



440 


THE SCHROEDINGER METHOD 


11. Show by an escample^* that the sum of the possible values of two 
(or more) dynamical variables is in general not a possible value of the 
sum of these variables; then consider the set (39) and explain why the 
eigenvalues of H (which equals -j- -j- can nevertheless be com- 
puted by merely adding the respective eigenvalues of and 

77* Elements of the Schroeding^r Method (Continued) 

Quantum numbers. The general solution of a Schroedinger 
eigenvalue equation ct\p ^ \\j/ involves the parameter X; the re- 
quirement that be well-behaved is imposed by ruling out the 
values of X for which ^ is ill-behaved ; and the remaining values of 
X are the eigenvalues of a. The general expression for the eigen- 
functions of a thus contains a parameter, called a quantum number , 
whose values are usually restricted. For example, the eigen- 
functions 19 of the operator p* contain the quantum number p# , 
restricted to be real; the eigenfunctions 42^® of the operator 
contain the quantum number a, restricted to the values 0, 1, 2, • - • ; 
the eigenfunctions 63^® of the operator Lz contain the quantum 
number mi , restricted to the values 0, d:l, dz2, • • • ; and so on. 
Accordingly, the general expression for the simultaneous eigen- 
functions of several operators, which are not simply functions of 
one another, contains several quantum numbers; for example, the 
simultaneous eigenfunctions 68^® of i* and L contain the two 
quantum numbers and Z. In the three-dimensional case, the 
general expression for the simultaneous eigenfunction of a suffi- 
ciently large set of commuting constants of motion usually con- 
tains three quantum numbers; for example, 26^® contains p* , pi , 
and 'Pz , while 48^® contains a, /3, and y. 

^ The eigenfunctions of a Schroedinger operator having a discrete 
eigenvalue spectrum form an orthogonal set, just as in the one- 
dimensional case; for example, the eigenfunctions 63^® of L, form 
a set orthogonal in the eigenfunctions 68^® form a set orthogonal 
in the angles, the eigenfunctions 42^® of form a set orthogonal 
in cc, and so on. In particular, the simultaneous eigenfunctions of 
a sufficiently large set of commuting constants of motion, includ- 
ing form a set orthogonal in the infinite region whenever the 
spejjtrum of each of these constants of motion is discrete — the set 
47 of the simultaneous eigenfunctions of H, ^ and 


For instance, consider the kinetic energy, the potential energy, 
total energy of a linear harmonic oscillator 



441 


MOTION IN THREE DIMENSIONS 


§77 


is an example; denoting the three quantum numbers appearing in 
such eigenfunctions by X, n, and v, we may then write 

j dr = 0 unless X = X', ii = n', v = v'. (1) 

For the present, we can illustrate (1) only by a reference to 47^*, 
in which case 'K — a, ti = and v = y. 

With the, help of suitable conventions, the triply subscripted 
symbols can be replaced by singly subscripted 4f’s (thus in 
47” we might denote by io , \PiiM by \Pi , ioio by , t^ooi by 

^ 3 , ^200 by , ^110 by , and so on) ; but the three-subscript 

notation is usually more convenient because it is quite explicit. 
When this notation is used, some of our familiar formulas have 
to be somewhat modified in appearance; for example, if a function 
}p admits of an expansion in terms of the triply subscripted 
we write 

(11”) l/' = (2) 

where X, m, and u run through their respective mutually consistent 
values, and find, using (1), that 


( 12 ^^) 


0\nv 


f 

/ dr 


(3) 


Whenever one or more of the subscripts in has a continuous 
range of values (that is, whenever the spectrum of one or more of 
the constants of motion under consideration is continuous), the 
corresponding summation or summations in (2) must be replaced 
by integration, and the formula (3) must be adjusted accordingly. 

The time dependence of the Schroedinger operands that pertain 
to a given dynamical system is governed, as in the one-dimensional 
case, by the second Schroedinger equation 


(i“> 


~ ^ 


( 4 ) 


where H is the Schroedinger operator associated with the Hamil- 
tonian of this system. In conservative three-dimensional motion 
of a single particle in the absence of magnetic fields, this operator 
is 8”, that is, — (c“V* + V{x, y, z). 

Arguments similar to those of §16 yield the following results 
for the three-dimensional case; 



442 THE SCHROEDINGER METHOD §77 

(a) Every Schroedinger eigenfunction of H belonging to the 
eigenvalue E has the form 

(4^®) yPE = (5) 

where is an instantaneous eigenfunction of H belonging to the 
eigenvalue E, Note our standard time factor 

(b) If an instantaneous Schroedinger function \p^ referring to 
the instant t = 0 is not an eigenfunction of H, then the corre- 
sponding time-dependent Schroedinger function is 

(14^®) \p = 2m Sv (6) 

where the ^xp,/s are tirnc-dependent eigenfunctions of H and the 
c’s are the same as those in the expansion 

(13“) = Ex E. E. (7) 

in which the are the i/^xp/s of Equation (6) with t set equal 
to zero. Hence, to compute \[/ when we are given we set up 
the expansion (7) and then multiply each in (7) by the appro- 
priate standard time factor. If all but one c in (7) vanish, the 
result is of course just (5). 

Whenever continuous spectra are involved, ' the appropriate 
summations in (6) and (7) are to be replaced by the integrations. 

Specification of states. In the case of a three-dimensional 
motion of a single particle, the ^configuration space^ is the ordinary 
three-dimensional space, so that the assumption V® of §17 takes 
the following form: 

Vf. When a system consisting of a single 'particle in three 
dimensions is in the state specified hy the Schroedinger function ypj 
then the expected average^ av^ a, of any dynamical variable a is 
given by the formula 

« < 8 > 

where a in the integrand is the Schroedinger operator associated 
'with the dynamical variable a, and 'where^ as indicated, the integrals 
extend over the entire three-dimennonal spacer 
The operator a in {8) operates only on \(/ and not on the func- 
tions of the coordinates that may be contained in dr; a more 
explicit notation for the numerator of (8) is / dr. If ^ is 



443 MOTION IN THREE DIMENSIONS §77 

normalized, then (8), the (three-dimensional) Schroedinger ex~ 
'pectation formula^ reduces to 

(2^) aV;^ a = J ir- (9) 

Given the Schroedinger ^-function specifying a state of a 
system, the expectation formula (8) enable® us, at least in prin- 
ciple, to compute (by studying the averages of the powers of £) 
the distribution function for this state of any dynamical variable f 
pertaining to this system, and thus to find the possible values 
of ? for this state and their respective probabilities; in other 
words, given the Schroedinger the formula (8) enables us to 
compute everything we might want to know about the corre- 
sponding state of the system. Conversely, given a verbal specifi- 
cation of a state, the formula (8) enables us, at least in principle, 
to identify the Schroedinger yp that specifies this state mathe- 
matically: the procedure is to construct a i/'-function which, when 
used in (8), yields the results contained in the verbal description 
of the state. 

Arguments similar to those of §17 yield the following results 
for the three-dimensional case: 

(c) If c is a numerical constant, then the Schroedinger functions 
)p and specify the same state. 

(d) The distribution-in-a;2/z (or the distribution-in-r^<^, and so 
on, if non-Cartesian coordinates are used) for a state specified by a 
Schroedinger function \p is For this reason, Schroedinger ^'s 
are called amplitudes of probability density, or simply probability 
amplitudes. 

(e) If \p is an eigenfunction of the operator a belonging to the 
eigenvalue a of a, then the result of a precise measurement of the 
dynamical variable a when the system is in the state \p is cer- 
tainly a. 

If \p is not an eigenfunction of the operator a, then the result 
of a precise measurement of the dynamical variable a for the 
state yp is not a certainty; the probability that this result will be a 
particular eigenvalue of a can be computed either by calculating 
the distribution-in-a for the state yp, or by inspecting the expan- 
sion of \p in terms of the eigenfunctions of a. For example, let 
P{En) denote the probability that a precise energy measurement 
will yield the result En when the state is specified by a normalized 



444 


THE SCHROEDINGER METHOD 


§77 


Schroedinger function \l/ and En is a nondegenerate level; then 
[compare P{En) equals the square of the modulus of the 
coefficient of \pn in the expansion of \p in terms of the normalized 
eigenfunctions of H. In case of degeneracy, this result is gen- 
eralized as follows: To determine P{En), expand the given {nor- 
malized) yj/ in terms of the normalized eigenfunctions of pick out 
in the expansion the coefficients of the eigenfunctions of H belonging 
to the eigenvalue En , and add the squares of their moduli — the 
resulting sum is P{En)- Our wording of this rule implies that the 
energy spectrum is discrete; in the case of a wholly or partly 
continuous spectrum, the rule must be modified in detail. 

To illustrate: Let the state of an isotropic oscillator be represented at 
some instant by the Schroedinger function 

+ ( 10 ) 

\ a a) 

what is the probability that the energy of the oscillator in this state is ^hvc ? 

The expansion of (10) in terms of the eigenfunctions of E can be ob- 
tained without recourse to integration, for, writing (10) as 

we recognize the bracketed functions as just the functions^ ^[m] > i^iioo ] , 
and i^[oio] of 48’® with A = 1. But the expansion (11) is not quite what 
we want; in the first place, the eigenfunctions of H appearing in brackets 
in (11) are not normalized, and, secondly, our i/ is itself not normalized. 
So, using the normalizing factors of Exercise 4’® and denoting the nor- 
malized functions 48’® by triply subscripted V'^s, w'e rewrite (11) as 

^ « Va^2^[ooo] + 2\/2^^^2 ^Hoo 3 - 3\/2o®7r»'2 ^[oiol , (12) 

and it remains to normalize 4^ itself. Multiplying the right side of (12) 
by its complex conjugate, integrating the result over all space, and re- 
membering the orthonormality of the triply subscripted ^'s, we find that 
= aM'* + 4-2aV»« + 9-2aV'“ = 27aV'2. Dividing (12) through 
by (27a®7r®/*)^^2^ -^0 finally get the equation 

V2^'» ^ “ 4/i 

The left side of (13) is a normalized Schroedinger function specifying the 
state in question; the right side is the expansion of this function in terms 
of the normalized eigenfunctions of H. 

The bracketed subscripts of the ^’s are the numerical values of the 
quantum numbers a, /3, and 7, in this order. 



r V 

lYI r X 

VI 

e 2V 

0/ 4-2 2 - e 2\fl 

0 


a 




445 


MOTION IN THREE DIMENSIONS 


§77 


Now, we are interested in the probability that the energy of the oscillator 
is jhvc , and consequently we pick out in (13) the eigenfunctions of H 
belonging to the eigenvalue These eigenfunctions are ^[loo] and 

, and their coefficients in (13) are \/8/27 and - VisT^. The sum 
of the squares of the moduli of these coefficients is f|, and hence the proba- 
bility that a precise measurement of the energy of our oscillator will yield 
the result f/ivc is I 7 . 

The rule given above in italics holds for real dynamical variables 
other than the energy. Thus, if P(an) denotes the probability 
that a precise measurement of the dynamical variable a will yield 
the numerical result an when the state is specified by a normalized 
Schroedinger function 1 ^, then, to determine P(an), expand the 
given (normalized) \l/ in terms of the normalized eigenfunctions of 
the operator a, pick out in the expansion the coefficients of the eigen- 
functions of a belonging to the eigenvalue an , and add the squares 
of their moduli — the resulting sum is P(an). Our wording of this 
rule implies that the eigenvalue spectrum of a is discrete ; in the 
case of a wholly or partly continuous spectrum, the rule must be 
naodified in detail. 

Probability-current density. The distribution-in-i^/s and the 
components of probability-current density satisfy the continuity 
equation 

^ by ^ dz sr ^ ^ 

Now, the Schroedinger expression for the distribution-in-a;^/^; for a 
state ^ is and consequently the Schroedinger expressions for 
Sx , Sy, and s* for the state yl/ must satisfy the equation 


bSx (_ dSy bSz 

dx By dz 



(15) 


The onc-dimensional formula 6^^ (which refers of course to 
probability current rather than to probability-current density) 
suggests the expressions 




446 


THE SCHROEDINGER METHOD 


§77 


The quantities (16) do indeed satisfy (15) whenever H has the 
form 8^^ and we adopt them^^ as the components of probability- 
current density for the state \l/ for the case when H is given by 8^®. 
In vector notation, Equations (16) are summarized by writing 

S = [\f’(grad - (grad (17) 


Exercises 

1 . The state of a free particle is specified at ^ = 0 by the unction 26^®. 
Use (4) to verify in detail that then the time-dependent i/^-function is 

(18) 

where B = (p'j -h ^ p'2)/2m. 

2. Show that the x-, y-, and 2 -component of the linear momentum 

of a free particle in the state (18) have certainly the respective numerical 
values VxiVyi and p* . For this reason we may say that (18) describes a 
particle moving in the direction having direction cos ines Vxlv't v'y/v ^ 
and p^/p' relative to the coordinate axes; here p' = p*^ + 

3. Set p' = p' = 0 in (18) and show that, as expected by classical 
analogy, the 2 -component of the angular momentum of a free particle 
moving in the 2 -direction is certainly zero. 

4 . Show that the component of L in the direction of the motion of the 
particle in the state (18) is certainly zero. 

B. The state of a free particle is specified at i = 0 by the ^-function. 

^0 s- g»(a*+6y) -|- e*(o»-6y)^ (19) 

where a, 6, and c are real constants, and 9 ^ c*. Construct the time- 
dependent Compute the possible values of p* , Pv , and p* for this 
state, and the respective probabilities of these values. 

6. Write down the time-dependent and normalized form of 48^®; include 
the arbitrary phase, which is now independent of a;, y, 2 , and t, 

7. Show that, if an isotropic oscillator is in its normal state, then each 
of the dynamical variables 37^® has certainly the value \hvc , and each of 
the dynamical variables Lx j Ly , Lg j and L has certainly the value 0. 

8. The state of an isotropic oscillator is specified at t = 0 by the in- 
stantaneous V'-f unction (10). Construct the time-dependent ^ and find 
the distribution-in-E for the instant U 

9. According to 48^®, the following instantaneous ^-functions belong to 
the (triply degenerate) first excited state of an isotropic oscillator : 

^1100] “ xe 2V®-' , 0JO1O1 = ye “V®/ ^ == ze . (20) 

(a) Show that = 0, Xy^[ioo] = , U*^[iooi = ffe^[oio] > 

Both 8^® and (16) are to be modified when the particle is electrically 
charged and moves in a magnetic field. 



MOTION IN THREE DIMENSIONS 


447 


§77 


■^'*^^1010] = L/y^im] — 0, Lt^p[ol(i] = i^xv^'icoi] = 

■^tf^Eooii — , J^i^[ooil — 0. 

(b) Show that, for the state ^[ooi] , the energy of the oscillator is cer- 
tainly ^hve , is certainly ihve , is also certainly ihvc , is certainly 
Ihvc , Lm is certainly 0, L is certainlyVI L* is uncertain and equally 
likely to be“ h or —h, and Ly is also uncertain and equally likely to be 
hoT —h. Make similar computations for the states ^[loo] and ^[oio] . 

(c) Show that, for the state of the isotropic oscillator described by the 

Schroedinger function ^[looi + , the energy is certainly Ihve ; i& 

certainly ihvc ; Lx is certainly ft; L is certainly ^/2 ft; L* (and also Ly) is 
uncertain, its possible values being ft, 0, and — ft, and the respective proba- 
bilities of these values being i, i, and }; (and also H^) is uncertain 
and equally likely to be ihve or fftpo . 

(d) Construct a linear combination of the ^-functions (20) describing a 
state for which L, is certainly —ft, and consider this state along the lines 
of part (c). 

(e) Construct a linear combination of the ^-functions (20) describing 
a state for which Lj, is certainly ft, and consider this state along the lines 
of part (c). 

(f) We are told that the energy of an isotropic oscillator is certainly 
IhPe and that no other information is available; write down the Schroe- 
dinger function that summarizes this information, and discuss the signifi- 
cance of its arbitrary features. 

(g) Show that, if the energy of an isotropic oscillator is certainly ^hvc , 
then the magnitude of its orbital angular momentum is certainly ft. 

10. Show that the functions (20) can be written as 


i^ciooi - -Rt:1 + rut'’ 

( 21 ) 

rPim] ” iRTl + fflTr* 

( 22 ) 

’I'lrni - Vi Rt;, 

( 23 ) 


where R = VlTr r exp (— rV2a*). 

11. Use Equations (21) to (23) to verify by inspection the results of 
parts (b), (c), (d), and (g) of Exercise 9 concerned with Lm and L, 

12. Describe by diagrams the distribution-in-a;j/z of an isotropic oscil- 
lator in the normal state; in the state a* l,j8« 0, 7 « 0. 

13. Show that for an isotropic oscillator^* av[i>:oj *■ Ja*, avfoooir* * Jo*, 
avioooipi = iftwa*, avuooja;* » avuoojy* ia*, avuooir* » ja*. 

14. Recall Exercise 5^® and show that, if the total energy of an aniso- 
tropic oscillator having a nondegenerate energy spectrum is certainly 

, then the respective averages of the potential and the kinetic energies 
are each. 

16. Show that, if the total energy of an isotropic oscillator is certainly 


To verify this distribution-in-L* , compute av I/J for fc «*» 0, 1, 2, 
and so forth, and then use the uniqueness theorem of §11. 

^® The subscripts of the symbol av are the values of the quantum num- 
bers Of, |3, and 7 for the energy state in question. 



448 THE SCHEOEDINGER METHOD §77 


En , then the respective averages of the potential and the kinetic energies 
are iEn each; remember the degeneracy of the energy spectruna. 

16. Show that the quantities (16) satisfy (15) whenever H is given by 8^«. 

17. Compute the distribution-in-ojj/z and the probability current for a 
free particle in the state (18), and note that the direction of the current 
is that of the motion of the particle. 

18. The ^-function (18) is called a plane monochromatic de Broglie wave. 

Show that the direction of propagation of this wave is that of the motion 
of the particle, and that the correlations and hold in the three- 

dimensional case. 

19. Derive the theorem concerning the distribution-in-J? stated in 
italics on page 444. 

20. Derive the Heisenberg inequalities 14^®. 

21. Use the Hamiltonian 8^® and the equation of motion 16^® to show 
that the Schroedinger operator associated with the radial momentum^^ 
Pr (that is, with mr) is 

Pr = + <24> 

13 3 

and verify that pi = t" T"- 

78. Ceatral Fields^* 


We shall now consider a particle moving in a (conservative) 
central field, in which case the potential function has the special 
form y = y(r), and consequently the Schroedinger Hamiltonian 


operator is 


H = - 



A. + 4- 

dx^ dy^ 



( 1 ) 


or, when the Cartesian derivatives are replaced by their polar 
equivalents,” 


K \r^ dr dr 


1 d . 
sin 6 dd 66 sin'-* 6 6^^) 




+ VW. (2) 


Radial momentum is an example of a quantity having a classical 
analogue but not admitting of a precise quantum-mechanical measure- 
ment; see Kemble, footnote on page 335. Lest our Equation (24) appear 
to contradict some of the statements on page 297 of Kemble, we must 
mention that Professor Kemble transforms operators and ^-functions from 
one coordinate frame to another by a method that is different from our 
elementary method. 

“ For brevity, we say ‘central field’ for ‘conservative central field,’ and 
‘central motion’ for ‘motion in a conservative central field.’ 

An elementary though tedious procedure for identifying the polar 
form of V* is to square and add the right sides of 68'^*, 69^*, and 70^-*. 



MOTION IN THREE DIMENSIONS 


449 


§78 


Using the operators and p, given by 65^* and 24” we can 
rewrite (2) in the condensed form 




U* + V(r), 


(3) 


which is precisely the form taken in the case of a central field by 
the classical Hamiltonian 40^^ 

Constants of motion, A most important characteristic of 
classical motion in a central field is that, quite independently of 
the explicit form of y(r), the orbital momentum is conserved, 
that is, Lx j Ly j Lz j and L are constants of motion. We have 
verified in Exercise 8^® that this characteristic persists in the 
Schroedinger theory of central motion, and it follows that accord- 
ing to (nonrelativistic) quantum mechanics the distrihution-in-Lx , 
the distribution-in-Ly , the distribution-in-Lz , and the distribution’- 
in-L are all stationary for every state of a particle moving in a 
central field. 

The operators Lx ^ Ly ^ and L, do not commute with one another 
(recall 70^®), and consequently ^-functions that are simultaneous 
eigenfunctions of these operators are rather an exception; in fact, 
it follows from Exercise 7^® that only spherically symmetric i^'s, 
that is, depending on r only, are simultaneous eigenfunctions 
of Lx i Ly f and Lz . Hence, although the respective distribu- 
tions in Lx , in Ly , and in Lz are all stationary in central motion, 
the fact that the value of one of these variables is a certainty for 
some state usually implies that the values of the other two are 
not certainties for this state. In the exceptional case of states 
specified by spherically symmetric and called s-statesy the 
values of Lx, Ly j Lz^ and L are all certainly zero. 

Eigenfunctions of H. When the field is central, the set 

H, L, Lz (4) 

is a set of commuting coiistants of motion.^® The simultaneous 
eigenfunctions of H, L, and Lz will be called for brevity the basic 
yp^s or, more precisely, the basic polar \p% where the word polar 
means that we use the spherical polar coordinate frame. Accord- 
ing to the expansion theorem, every i/'-function pertaining to a 


The set H, L, and Lx and the set H, L, and Ly are also sets of com- 
muting constants of motion; but they are not as convenient as (4) because 
the polar forms of the operators Lm and Ly are not as simple as the polar 
form of the operator Lz * 



450 


THE SCHROEDINGER METHOD 


§78 


given system with a spherically symmetric potential is either a 
basic (polar) ^ of the system or a superposition of the basic 
(polar) ^’s. 

Now, the simultaneous eigenfunctions of L and L, are 

(68”) m) = ■BT(1 mi), (5) 

where R is an arbitrary well-behaved function of r. To compute 
the basic ^’s, it thus remains to adjust R so as to make (6) an 
eigenfunction of H belonging to an eigenvalue E ol H, that is, 
to adjust B so as to satisfy the equation 

H^(im,) = E^(fmj). <6) 

Using the form (3) of E and remembering that ^(Z mi) is an eigen- 
function of L* belonging to the eigenvalue Z(Z -f 1)^*, we find that 

H\p(.lmi) — "I" 

= ^ p*^(Z mi) -I- Z(Z -t- 1 ) hV(Z w;) + FiZ'(Z mi) (7 ) 

Equating this to the right side of (6) and canceling the common 
angle factor, we finally get the equation 

^plR + }^l+^R + Vir)R = ER, ( 8 ) 

2m icr 

that is, 

_ 1 1 ^ R 4 - V{r)R = ER, (9) 

K dr dr kt^ 

called the radial equation. 

We now introduce the function 

x(r) = rR 

and find with the help of (9) that it satisfies the equation 


( 10 ) 



451 


MOTION IN THREE DIMENSIONS 


§78 


This equation has the general form of a one-dimensional Schroe- 
dinger equation describing the radial motion of a particle in a 
potential field^^ Z(Z + l)//cr^ + F(r), md we may expect by 
analogy with the true^^ one-dimensional case that solutions of 
the X“6quation (11) which yield well-behaved J2's exist only for 
restricted values of jB, and that in the process of solving (11) an 
extra quantum number, say n, would usually come into play. 
The well-behaved would therefore usually involve n, called 
the principal quantum number, in addition to the azimuthal 
quantum number Z appearing in (9), and we may denote them by 
R{n Z); the basic polar ^'s then take the form 

yp{n I mi) = R{n Z)T(Z mi), (12) 

The value of E in (9) belonging to R(n T) and denoted by Eni 
is independent of the axial quantum number mi , since mi does 
not appear in the radial equation. It follows that the energy 
spectrum of a particle in central motion is degenerate with respect 
to the axial quantum number; in fact, a level having a specific value 
of I consists of 21 + 1 coincident levels. Indeed, to construct a 
basic \l/ when a specific R(nl) is given, we must let Z in T{lmi) 
be the same as in R{nl); but, this done, we may choose mi in 
T(Z mi) in any one of the 2Z -f- 1 ways consistent with the given Z, 
each choice yielding a distinct ^(n Z mi) belonging to the same 
eigenvalue of H. 

The conclusion that the energy spectrum for central motion 
must be degenerate can, of course, be drawn directly from the 
result of Exercise 9®® and the fact that Lx , Ly , and L, , which are 
constants of motion for central fields, do not commute with 
each other. 

A state for which the value of L is certainly 0 is called an 
s-state; one for which L is certainly •\/2 h is a p-state, one for 
which L is certainly A/fi ft is a d-state, and one for which L is 
certainly a/12 ft is an f -state; the symbols $, p, and d have their 


" The term Z(Z + D/xr* in (11), associated with the term L^I2mr^ in (3>, 
represents a centrifugal effect; note that classically L^/2mT^ is that part 
of the kinetic energy which arises because of the tangential motion of the 
particle, that is, the motion at right angles to r. 

Note that the independent variable r in (11) has the range (0, «>), 
rather than (— «> , «> ) as in the true one-dimensional case, and that, accord- 
ing to (10), good behavior, of R usually implies that x vanishes at r « 0. 



452 


THE SCHROEDINGER METHOD 


§78 


origin in the experimental classification of spectral series into 
sharp, principal, and diffuse. For states described by the basic 
functions (12) we then have the following names: 


Table 18 


Value of 1 

Name of state 

0 

s-state 

1 

p-state 

2 

d-state 

3 1 

/-state 


For states with larger Vs the names are assigned alphabetically, 
starting with g. 

The radial equation, which involves 7(r), must be studied 
separately for each specific central field, so that we cannot go 
further in classifying energy levels without turning to special cases. 

Exercises 

1. Show that, if (12) is normalizable and R in (12) is real, except perhaps 
for a complex multiplicative constant, then the average of pr for the state 
(12) is zero. Explain why this result is to be expected on intuitive physical 
grounds. 

2. Using the abbreviation B(r, e) = (h/mr sin 6) \R{n l) |*[0(^wz/)P, 

show that, if R in (12) is real, except perhaps for a complex multiplicative 
constant, then the components of the relative probability current for the 
state \f/(n I mi) are s* — d) sin <^, Sy = miB{r, $) cos <t>, and Sm « 0. 

Show that hence the lines of flow of the probability current are circles 
centered on the z-axis and parallel to the xy-plane,*^ and that the direction 
of the flow is consistent with the fact that ^{nl mi) specifies a state for 
which X, is certainly mih. 

3. Write down the Schroedinger equation = Eyf/, where H is given 
by (2), assume that ^ has the form R{r)0(e)^(<f>), and derive the radial 
equation (9) by separating variables. 

79. The Free Particle; Polar Quantization 

Perhaps the simplest example of a central motion is offered by 
a free particle, in which case 

V = V(r) = 0, (1) 


The symmetry of the current about the z-axis may on first thought 
seem surprising; this point will be taken up in §80. 





453 


MOTION IN THREE DIMENSIONS 


§79 


and the x-equation 11 ” becomes 

r 1 d® . Z(I + 1) 


j X = Ex. 


When we introduce the new independent variable 

s = -s/KEr (3) 

and use the subscript I with x to help in classifying the solutions, 
( 2 ) becomes 


f 1 _ 1(Z + 1) 


Xiis) = 0. 


A particular solution of this equation is 
Xi(s) = 

where J j+t , the Bessel function of order Z + 5 , is” 






Z + Jfc)! 

k)\{2sY 


_i_ + k)] 

^ k\il - k)li2s)’’ 


so that, for example, 

«7t(«) = /^^sins (7) 

*) ®) 
■^1“ ■ 1/1 [(? - 0 ® ® 

Each J j+j vanishes at s = 0 and, as s increases, oscillates about the 
s-axis with gently decreasing amplitude; the nodes of the J’a are 
unequally spaced except in the case of jj . 

Equation (4), being of the second order, has solutions that are 
linearly independent of the particular solution ( 5 ) ; but these solu- 
tions yield ill-behaved R’a and hence are of no interest to us. 


** It should perhaps be emphasized that (6) is correct only when I is 
zero or a positive integer, which is so in our case. 



454 


THE SCHROEDINGER METHOD 


§79 


Dividing (5) by r, multiplying the quotient for the sake of 
appearances by (idE)~*, and denoting the result b}^® l)> 'we get 

BiE, 1) = s-^Ji+iis), s = ^/kEt. (10) 

According to 10''* the functions (10) are, apart from arbitrary 
multiplicative constants, the radial factors of the basic polar ^’s 
for a free particle. In particular, apart from arbitrary multi- 
plicative constants,^® 

B(E, 0 ) = ( 11 ) 

s 

( 12 ) 

o S 

B(B,2)=0-l)»in»-35^“. (13) 

Graphs of these functions are shown in Fig. 104. 



The functions (10) involve the parameter E through s. The 
function R(Ey0) is well-behaved if jE 7 > 0 and is ill-behaved other- 


2® We write R{E, 1) rather than R{n 1) because the energy of a free 
particle is not quantized. 

®«Note;^t in going from (10) to (11), (12), and (13), we omitted the 
factors A/ 2 /ir ■*'hat appear in the J^s. 


465 MOTION IN THREE DIMENSIONS §79 

wise; the remaining R’s are well-behaved if E > 0 and are ill- 
behaved otherwise (they vanish identically for E = 0). Thus 
every positive value of E in (2), zero included, yields well-behaved 
R’s, so that, as we have already shown in §76, every positive 
energy value, zero included, is a possible energy value of a free 
particle (when the potential is taken to be zero). 

According to (10) and 12”, the basic polar ^’s for a free particle 
of energy E are,®^ when we include the time factors but omit the 


arbitrary multiplicative constants, 

HE, I, mi) = s-V,+j(s)T(l m,)e-*"‘, (14) 

SO that we can write, omitting further numerical coejficients, 

HE, 0, 0) = e-'^‘"’ (15) 

HE, 1, - 1) = sin e e~’* e-'^‘"’ (16) 

■ HE, 1,0)= cos e e-‘^‘"’ (17) 

HE, 1, 1) = sin d e'* (18) 

and so forth. 


If E in (14) is greater than zero, then every one of the functions 
(14), and hence also every linear combination of these functions, 
is an eigenfunction of the Hamiltonian (of the free particle) be- 
longing to the eigenvalue E, so that every energy level of a free 
particle, except the level jB = 0, is degenerate to an infinite 
degree; this result was obtained once before in §76. The de- 
generacy of the energy spectrum of a free particle is thus much 
greater than the minimum degeneracy pointed out in italics on 
page 451 . 

Relations between the polar and the Cartesian ^-functions for 
a free particle. Let us call for the moment the functions (14) 
the (basic) polar l/^^s of the free particle, and the functions 

(18”) (19) 

the (basic) Cartesian yp’s of the free particle. 


” Only (16) belongs to E => 0. 



456 


THE SCHROEDINGER METHOD 


§79 


The Cartesian \l/^a were computed by the use of the commuting 
constants of motion H, p# , Py , and p* ; and each of them repre- 
sents a state for which not only the energy but also p» , py , and 
p* are certainties. 

The polar \l/^a were computed by the use of the commuting 
constants of motion i?, L, and Lg ; and each of them represents 
a state for which not only the energy but also L and Lg are cer- 
tainties. 

Hj L, and Lg being a set of commuting constants of motion of a 
free particle, the expansion theorem claims that every ^-function 
of a free particle is a superposition^ of the polar ^^s of a free 
particle. Thus the expansion theorem claims, in particular, that 
each of the functions (19) is a superposition of the functions (14). 
This is indeed so. To illustrate, we consider the Cartesian 

( 20 ) 

which represents instantaneously a free particle of definite energy 
moving in the 2 -direction, and which can be written as*® 

( 21 ) 

where s = VkS r. The function (21) satisfies the relation*® 

® = £ [fVV2(2f -f 1)] s'* /j+i(s)T? , (22) 

I— 0 

and we recognize the right side of (22) as a linear combination of 
our instantaneous polar (14). The possibility of expanding 
the Cartesian in teims of the polar i/^'s has thus been verified 
for the special case of a particle moving in the z-direction. 

Similarly, the set if, p* , Py , and p, being a set of commuting 
constants of motion of a free particle, the expansion theorem 
claims that every i^-function of a free particle is a superposition 
of the Cartesian ^'s of a free particle. Thus the expansion 
theorem claims, in particular, that each of the functions (14) is 
a superposition of the functions (19). This claim can also be 
verified rigorously. 

“ A superposition may of course involve only one term. 

Note tha t^3 = r cos 0 and that, when p* = p^ = 0, then, according to 
28^®, p, = V2mE and pjh - ViS. 

** See Bauer* B formula in a book on Bessel functions. 



457 


MOTION IN THREE DIMENSIONS 


§79 


Exercises 

1. The state of a free particle is specified at f = 0 by the ^'-function 
sin ar cos where a and jS are nonvanishing real constants having 

the physical dimensions cm“i. Show that the energy of the particle is 
not a certainty, and that its possible values are (a -J- and (a — 
and construct the time-dependent 

2. Verify the result of Exercise 3^^ by inspecting (22). 

3. Use polar quantization to compute the possible energies of a free 
particle when V{r) — Vo = constant 0. 


80* The Bohr Formula for the Hydrogen Atom; Fixed Nucleus 

We consider next an electron (mass m and charge®^ —e) which, 
when at a distance r from the origin 0, is urged toward 0 with 
the inverse-square or Coulomb force of magnitude e^/r^, as though a 
nucleus of charge e were permanently located at 0. This system 
we call, rather loosely, a hydrogen atom. For the present we 
neglect relativity effects and effects due to the so-called elec- 
tron spin. 

Our potential function is now®^ 

Vir) = -i-, (1) 

r 

our basic polar ^'s, as always in central fields, are of the form 


(12”) ^(nZmO = ie(nf)T(Zm,); (2) 


and the equation ll” satisfied by the function 


( 10 ”) 
is now 


X = rli 


\ 1 . Kl + 1 ) 

K dr^ kT“ 


- “J X = -Ex. 


(3) 

(4) 


The x-c^quation (4) has the form of a Schroedinger equa- 
tion for the linear motion of a i:)article in the potential field 


This e, having the value of approximately 4.8 X lO^i® electrostatic 
units, is not to be confused with the base of natural logarithms. 

32 Note that our V denotes the potential energy of the electron in the 
field of the nucleus, and not the electrostatic potential due to the nucleus. 



458 


THE SCHROEDINGER METHOD 


§80 


1(1 -j- 1)/Kr" — e^/r, indicated without 
regard for scale in Fig. 105; and, by anal- 
ogy with the one-dimensional case of 
conditional binding” shown in Fig. 48(a), 
we may expect that the energy spectriun 
of our system consists of a continuous 
part extending upward from E = 0 and 
a discrete part lying below E = 0. In 
proving this to be the case, we shall 
consider separately the positive and the 
negative values of E in (4). Inciden- 
theformiy-l-l)/«r»-eVr. tally. Fig. 105 suggests that when I > 0 
the particle is strongly repelled from the 
origin in the vicinity of the origin; consequently we may expect 
that for states other than s-states the ^-functions should vanish 
at 0. 

The case £ > 0. If F? is positive, the substitutions 

n = i = 2-v/i^ r, (5) 



inK "PiiTirkfi /vna nf 


reduce (4) to 


de 


+ 


[-i 


4^? 


1(1 + 1 ) 


]x(f) -0, 


( 6 ) 


where is a positive parameter. To determine the possible posi- 
tive values of E we must find the positive values of m for which (6) 
has solutions that are well-behaved when divided by r, that is, 
solutions that yield well-behaved JS’s. 

The solutions of (6) can be expected to be free from ill behavior 
except perhaps at the origin and at infinity, and hence we can 
restrict our attention to these extremes. To determine the be- 
havior of X at f = 0 we assume that 

x(?) = 4“ + • • * ; C^) 


where aj 7 ^ 0, substitute (7) into the left side of (6), and find 
that the term containing the lowest power of $ in the result is 
Orlrir — 1) — l{l + 1)]?’’"^. In order that this term -vanish we 
must have T(r — 1) = + 1), that is, r = Z + 1 or r = — Z. 


Such analogy is not always reliable; see, for example, the discussion 
of the inverse-cube field by G. H. Shortley, Phys. Rev., 38, 120 (1931). 



459 


MOTION IN THREE DIMENSIONS 


The power-series solutions of (6) near the origin thus have the 
form . ' 

x(0 — + higher powers of (8) 

or the form 

x($) = + higher powers of (9) 

where a and b are arbitrary nonvanishing constants. Now, the 
series (8), when divided by r, vanishes for r = 0 when I = 
1, 2, • • • , and has a finite non vanishing value for r = 0 when 
1 = 0; this series thus always yields an JR that is well-behaved at 
the origin. Hence for every positive value of E the radial equa- 
tion has particular solutions which are well-behaved at the origin 
and differ from one another by constant factors. The series (9), 
on the other hand, is of no interest to us since, when divided 
by r, it is ill-behaved at 0 even when 1 = 0, 

For large values of Equation (6) reduces to the approximate 
equation -f ix = 0? so that its general solution has, for large 
values of the form 

X ^ (10) 

Dividing this result by r, we find that for every positive value of 
E the general solution of the radial equation is well-behaved at 
infinity. 

Now, whatever the form of a solution of (6) may be near 
$ = 0, this solution will have the form (10) when J oo , In 
particular, the solutions that have the form (8) for small $’s will 
have the form (10) for large S^s. Since (8) is well-behaved for 
small and (10) for large and since solutions of the form (8) 
exist for every positive E, wo conclude that the radial equation 
has well-behaved solutions for evt.iy positive' value of E. And 
it finally follows that a7i electron moving in the Coulomb potential 
— i/r may have ariy positive energy whatever. 

The case £ < 0. If £ is negative, the substitutions 

X = ieW^/E, P = 2y/^mEr (11a, b) 

reduce (4) to 


( 12 ) 



460 


THE SCHROEDINGEE METHOD 


§80 


For small values of p, Equation (12) reduces to the same form 
as does (6) for small values of so that for every negative value 
of E the radial equation has particular solutions that are well- 
behaved at the origin and differ from one another by constant 
factors. 

For large values of p, Equation (12) reduces to the approximate 
equation x" — ix = 0* This equation has the solution 
which misbehaves when p oo, and the solution which 
vanishes when p — > oo ; for large values of p, it also has the approxi- 
mate solutions and so forth, which vanish when 

p — > 00 . Hence, for every negative value of E the radial equa- 
tion has particular solutions that are well-behaved at infinity and 
particular solutions that are ill-behaved there. 

It follows that, for a specific negative value of E, the particular 
solutions of the radial equation which are well-behaved at r = 0 
are either all well-behaved at infinity or all ill-behaved at infinity, 
and consequently we may expect quantization. 

To determine the precise values of the discrete energy levels, 
we take up (12) anew and, taking the hint from the forms of its 
solutions that are well-behaved at p = 0 and of its solutions that 
are well-behaved when p — » , write x in the form 

X = (13) 

where v{p) is as yet undetermined. From this point the pro- 
cedure is quite similar to that which we used in the case of 3^, 
and we shall omit the details. When we express v as the power 
series 

v{p) = OrP + Or+ip’^’*'^ -f- aT+2p’^^^ + ‘ * j (14) 

with Or 0, it turns out that r = 0 or r = — 2Z — 1, that the 
successive coeflBicients in (14) are related through the equations 

a, (v + l)(v + 2l + 2)’ ^ ^ 


3^ Our discussion of the motion in a Coulomb field is quite brief; but the 
problem is an important one, and the reader should at the first opportunity 
study it in detail. See, for example, Pauling and Wilson, Chapter V; 
and Slater and Frank, Chapter XXXIII. For a very detailed treatment, 
see H. Bethe, “Quantenmechanik der Ein- und Zwei-Elektronenprobleme'^ 
[hereafter cited as Bethe], Handbuch der Physik, second Edition, Volume 
XXIV-1; Berlin: Springer, 1933. 



461 


MOTION IN THREE DIMENSIONS 


and that the solutions (14) are well-behaved if r == 0 and if is a 
terminating polynomial, and are ill-behaved otherwise. Accord- 
ing to (15), V terminates if and only if 

X — V -f- Z + 1) I' = 0, 1, 2, • • • , (16) 

that is, if and only if X is an integer greater than zero, say 


X = n, n - 1, 2, 3, ... . (17) 


Using (11a), we now find that the discrete energy spectrum of an 
electron moving in the Coulomb potential -‘C'/r consists of the 
following levels: 




2tt^ me^ 


1,2,3,.... 


(18) 


This celebrated result was first obtained by Bohr in 1913 and 
marked the first triumph of his atomic theory. 

The polynomial v(p), starting with a term in p® and teiminating with a 
term in p*', that is, in p”'^“h turns out to be, apart from a constant factor, 
the polynomial 

(19) 

The polynomial e* — (a:*e'*) is denoted by Lk{x) and is called the Laguerre 

polynomial of the A:th degree; the mth derivative of L: {x) with respect to x 
is denoted by Z/”(a;), so that the polynomial (19) is i. -noted by LliY{p). 
Thus the x's yielding well-behaved R*b, if we omit the arbitrary constants, 
are 


x(n 1) 


(20) 

Introducing the constant 

do =* — : , 

me^ 

(21) 


called the radius of the first Bohr orbit in hydrogen^ and recalling (lib) and 
(18), we find that the independent variable in (20) is 

2 

P “ — r. 

nao 

Dividing (20) by r, we get R(n 1); the result, when normalized in such 

poo 

a way that j R(n l)R(n l)r^ dr = 1, is 


R(nl) = e<TjV’(n Z)p‘e'''“/4+V(p)> 


(23) 



462 


THE SCHROEDINGER METHOD 


§80 


where y [strictly, Yni] is an arbitrary real constant, and where 


mnl) 




4(n — Z — 1) ! 
aln^Kn + Z)!? 


(24) 


The normalized basic \p’a specifying instantaneously the discrete energy 
states are consequently 


Hn I mi) = e'yNin Z)p^e-p/2L;iV(p)T(Z mi), (25) 


where the 7’s [strictly, the y(n I mz)*s] are arbitrary real constants. In 
particular, the instantaneous normalized ^-function for the normal state 
(that is, the state with n = 1, Z — 0, and mi = 0) is 


^(100) = e‘> 



(26) 


Degeneracy of the discrete spectrum.* Since (12) has satis- 
factory solutions only for particular values of X, and since it 
involves from the outset the azimuthal quantum number Z, we 
may expect that its satisfactory solutions and also the possible 
values of E would involve at least two quantum numbers, one of 
which is 1. As* shown by (25), this is true insofar as the i/'-f unc- 
tions are concerned; but, as shown by (18), this is not true for the 
energy levels: the energy levels depend only on the principal 
quantum number n, and are degenerate with respect to the azi- 
muthal quantum number Z. This ^accidentaF degeneracy means 
that the degeneracy of the discrete energy spectrum of a hydrogen 
atom is greater than the minimum degeneracy pointed out in 
italics on page 451. 

According to (16) and (17), 

n>l + l, (27) 

so that the following n values of Z are compatible with a prescribed 
value n of the principal quantum number: 

Z = 0, 1, 2, . . . , (n - 1). (28) 

Now, there are 2Z + 1 values of mi which are compatible with a 
prescribed value Z of the azimuthal quantum number, and there- 
fore there are 

[ 2-0 + 1 ] + [ 2.1 + 1 ] + [ 2.2 + 1 ] 

+ * • • + [2'(w — 1) + 1] = (29) 



463 


MOTION IN THREE DIMENSIONS 


distinct pairs of values of I and mi which arc compatible with a 
prescribed value n of the principal quantum number. And, since 
the energy formula (18) involves only it follows that in the 
Coulomb case the degeneracy of the nth quantum state is n^-fold. 

The continuous energy spectrum of the hydrogen atom (that is, 
the spectrum consisting of all positive values of E) is also de- 
generate. But a discussion of this degeneracy lies outside the 
scope of this book. 

Concluding remarks. A particle moving classically in a central 
field remains in a plane that is perpendicular to the orbital angular 
momentum L; consequently, should we expect the motion to 
exhibit any symmetry at all, this symmetry would be with respect 
to an axis having the direction of L and thus inclined to the 
2 -axis at an angle whose cosine is Lg/L. To determine the direc- 
tion of L it is of course necessary to know all the three quantities 
Lx, Ly ^ and L, . Now, quantum mechanically, the knowledge of 
Lx and Ly is incompatible with that of Lg (except for s-states), and 
consequently, quantum mechanically, the direction of L is never 
known. For states for which both Lg and L are known and 
L 7*^ 0, we may, however, picture L as lying somewhere on a cone 
that makes with the 2 -axis the angle whose cosine is L*/L. It 
then follows that states with known Lg and D, if they should 
exhibit any symmetry at all, should exhibit it with respect to the 
2 -axis. This is indeed the case: note, recalling 12’®, that our basic 
are. independent of <t> and, recalling Exercise 2^®, that the 
probability currents associated with the basic are also inde- 
pendent of 0. 

A particle moving classically in a central field may remain in 
the equatorial plane (the xy-plmo); this case arises when L is 
directed along the 2 -axis, so that Lg has the largest value con- 
sistent with the value of L, that is, the value Lg = L. According 
to quantum mechanics, the largest value of Lg consistent with the 
value ^l{i+\) hoi L is Ihj and consequently the smallest angle 
that L can make with the 2 -axis has the cosine Ijy/Hf -+- 1). 
This cosine approaches 1 for large values of I, and we may there- 
fore expect by classical analogy that, for a state having a large I 
and having as large an mi as is consistent with this I, namely, 
mi = Z, the particle should be very likely to be near the equatorial 
plane. It will be shown in Exercise 5 that this is so. 

If L = 0, the classical motion of a particle in a central field 



464 


THE SCHROEDINGER METHOD 


takes place along a straight line, passing through 0, whose direc- 
tion can be ascertained as soon as the initial conditions of the 
motion are known; but if all we know is the energy of the system 
and the fact that L = 0, then the direction of this line is entirely 
indeterminate, and hence the distribution-in-position of the par- 
ticle is spherically symmetric about 0. Since states for which 
L = 0, that is, s-states, are described by spherically symmetric 
xp^Sj the situation is quite analogous in quantum mechanics. 

For the basic polar states, the possible orientations of L with 
respect to the s-axis (not 
the orientations of L in 
space, which are never 
known) can be pictured by 
vector diagrams. Thus 
the first diagram of Fig- 
106, which refers to the case 
Z = 1, shows the vector L 
of length L = \/T^ h in 
the three positions corre- 
sponding to the respective 
possible values — ft, 0, and 
ft of its ^-component L» ; 
the second diagram refers 
in asimilarwaytod-states. 

Exercises 

1. Consider the normal state 
(a) the most probable value of r for this state is flo; (b) avioo 

(c) avioor-2 - 2a5-2; (d) if /c = 1, 2, 3 • • • , then avioo r* = (/b -f 2)IaS/2*'+i; 
(e) avioop® = ftVSoo f (f) aviooF(r) = — eV^o ; (g) the average potential 
energy equals twice the total energy; (h) the average kinetic energy equals 
minus the total energy. 

2. The state of a hydrogen atom is specified at i = 0 by the ^-function 

(8 — p sin e compute the distribution-in-.S, the distribution-in-L, 

and the distribution-in-Jj* ; and set up the time-dependent xp, 

3. An electron moves in the fixed Coulomb field Ze^/r^ as though a 
nucleus of charge Ze were permanently located at 0. Show by inspection 
that the energy levels of this system^® and the various \P'b pertaining to it 


This system is of interest in connection with the so-called hydrogenic 
or hydrogen-like atoms which, like the deuterium atom, the singly ionized 
helium atom, and so forth, consist of a nucleus and a single electron. 



p- States 



d - States 

Fig. 106. Vector diagrams for L. 


(26) of a hydrogen atom and show that: 



465 MOTION IN THREE DIMENSIONS ^*80 

can be obtained from the corresponding results of this section by merely 
replacing e* by Ze^; this means, in particular, that, in results expressed 
entirely in terms of ao , ao is to be replaced by ao/Z. 

4. An electron moving in the fixed Coulomb field of Exercise is 
in the normal state. Show that then the average electrostatic poten- 
tial in space due to both the fixed field and the field of the electron itself 

2Zr 

is e{Z — l)/r + {eZIao + e/r)e "o . 

6. Recall 78^^ and show that for a basic polar state with I large and 
with mi = I the particle is very likely to be near the equatorial plane. 

6. Note that the motion of a spatial isotropic harmonic oscillator is 
central, and compute the energy levels 52^® using the constants of motion 
Hy Lj and L, , that is, using polar quantization. 

7. Show that the functions i2T5 , and 72T} of Exercise 10’*^ are 

the basic polar of the isotropic oscillator belonging to the first excited 
state. 

8. Show on physical grounds that, if a basic Cartesian of an isotropic 
oscillator (that is, a of the form 48^®) belonging to the nth energy level 
is expanded in terms of the basic polar i/'s of the oscillator (that is, ip's 
of the form 12^®), then only the polar i/*s belonging to the nth level wdl 

^ appear in the expansion. The expansions of Exercise illustrate this 
result, the converse of which is of course also true. 

81. The Bohr Formula for the Hydrogen Atom; Free Nucleus 

We now proceed to remove the physically unsatisfactory condi- 
tion that the nucleus of our hydrogen atom be held fixed, and turn 
to the problem of an electron (mass m, charge —c) and a proton 
(mass My charge e) moving nonrelativistically in each other’s 
field and free from external influences. 

As we have seen in a number of instances, the quantum- 
mechanical study of a dynamical system is made quite systematic 
as soon as the constants of motion of the system arc identified. 
And since the constants of motion of a classical system, which arc 
usually identified rather easily, are certain to be constants of 
motion of a corresponding nonrelativistic quantum-mechanical 
system, we begin with a few remarks on the classical theory of 
our present system. 

The coordinates and the components of the linear momentum 
of the electron will carry the subscript 1, those of the proton the 
subscript 2. The distance between , the two particles is then 
[(xi — X 2 Y + ( 2/1 — 2 / 2 )^ + (^1 their mutual electric 

potential energy when they are at this distance apart is 



466 THE SCHROEDINGER METHOD §81 


— — xif + {yx — 2 / 2 )^ + (zi — Z 2 )^]^ and the Hamiltonian 

of the system is 

^ ~ ^ (P*i + fix + P*i) + (P *2 + Vvi + pld 


— ,V.--rr ■ . \xj 

V (a;i - 2:2)2 + {yx - i/iY + (Zx - Zi)^ 

Since the forces exerted by the proton on the electron, and con- 
versely, are directed along the straight line joining the proton and 
the electron, and since there are no external forces, it is perhaps 
obvious that the center of gravity of the two particles, whoso 
coordinates are 

y _ mxj + Mx 2 y _'myi + My^ ^ _ mzi + Mz^ fcy\ 

m + M ^ m + M' m + M ^ 

will move uniformly in a straight line, while the electron and the 
proton revolve about their center of gravity as though this center 
were at rest. Hence we may expect a simplification if we replace 
the six coordinates of the electron and proton by the three coordi- 
nates (2) and by three additional internal coordinates describing 
the internal configuration of the system; for the latter we choose 
the three coordinates 


X = Xi - X2, 2/ == 2/1 - 2/2 , Z = Zi — Z2 (3) 

of the electron with respect to the proton. 

Rewriting (1) in terms of the new variables (2) and (3) and 
the quantities 

P. = (m -h M)X, Py = {m + M)F, P, = (m + M)Z (4) 
and 


Vx — Py — — fxtj 

where ju, defined as 


_ mM 
~ m + M' 

is called the reduced mass of the system, we get 
H = + H\ 


(5) 

( 6 ) 


( 7 ) 



467 


MOTION IN THREE DIMENSIONS 


§81 


where 


and 


/r = 


2(m + M) 


{Pi + Pl + Pi) 


h^-^^(pI + pI + pI)-^-. 


( 8 ) 

(9) 


r being the distance between the electron and the proton. 

The term H"" in (7) refers to the motion of the center of gravity 
and is the Hamiltonian of a free particle of mass m + M; the 
term H' refers to the internal motion and is the Hamiltonian of a 
particle of mass n moving in a fixed Coulomb field with 7 = —e^/r; 
W and H'‘ have no variables in common, so that the motion of the 
center of gravity and the internal motion are independent of one 
another. The total energy E of the system has the form 

E = + K] (10) 

here is the translational kinetic energy {Pi + Pi + Pi)/ 
2(m + ^); computed as though the total mass m + M were 
concentrated at the center of gravity; and E* is the internal 
energy, that is, the energy of the electron and the proton moving 
under each other’s influence about their center of gravity as 
though this center were at rest, or, if we use the particular internal 
coordinates (3), the energy of a particle of reduced mass m and 
charge — c moving about a fixed nucleus of charge e. 

Insofar as the motion of the center of gravity is concerned, the 
quantities P* , Py , and P* are constants, and since this motion 
is independent of the internal motion the P’s are constants of 
motion of the complete system; similarly, the constants of the 
internal motion are also (constants of motion of the complete 
system. 

Schroedinger theory. In the present problem we encounter 
for the first time the task of identifying the Schroedinger operators 
to be associated with the coordinates and the components of 
linear momentum of two interacting particles. The procedure 
(which can be justified by a study of Poisson brackets relating to a 
system containing two particles) is a direct extension of that for 
the case of a single particle : with the variables xi, yi j Zi, pxi , 
Pi/i , and pzi pertaining to the electron we associated the respective 
operators Xi^yi^Zi ^ --ih d/dzi , —ih d/dyi , and — d/dzi ; and 



468 


THE SCHROEDINGER METHOD 


§81 


with the variables Xi , yi , Zt , Pxt , Vm > P»j pertaining to the 
proton we associate the respective operators X 2 ,ys ,ei , d/dXi , 
-ih d/dyi , and -ih d/dzt . The operands of these operators are 
well-behaved functions of the six variables Xi , yi , zi , Xt , yt , 
and 22 ; good behavior®* is now defined by a direct generalization 
of the definition of §74. 

The Schroedinger operator associated with the Hamiltonian 
(1) is 



The classical procedure suggests that we introduce the new coor- 
dinates defined by (2) and (3), and the new momentum operators 


P, = -ih ± , P, = -ih P. = -ih ± (12) 


dX 


dZ 


and 


P* = Pv = p. = (13> 


dx' " dy 

In terms of these, the operator (11) becomes 
H = H° + H\ 

where 


32 


W = - 


r 


2(m + M) 


^ 372 1 - QZh 


<14) 

(15) 


It should perhaps be emphasized that the two particles- untier con- 
sideration here— electron and proton — are distinguishahle. In the case of 
identical particles, the ^-functions must not only be well-behaved but 
must also satisfy certain symmetry conditions (to put it roughly, the 
additional conditions take care of the experimental impossibility of de- 
tecting an interchange of two indistinguishable particles); this extremely 
important point lies outside the scope of this book. 

Incidentally, in the case of two particles, the configuration space re- 
ferred to in the general Schroedinger expectation formula is a fictitious 
6-dimensional space; its axes refer to the six coordinates Xi , yi,zi, xt , 
2 / a , and , so that every configuration of the two particles in 3-space is 
represented by a single point in the configuration space. 



469 


MOTION IN THREE DIMENSIONS 


§81 


and 


H' = 


2m Vax* at/" dzy 


The commuting dynamical variables 


r ' 


H, P. 


Pz 


(16) 

(17) 


are among the constants of motion of our system, and we begin 
by determining the form of their simultaneous eigenfunctions. 
The simultaneous eigenfunctions of the P’s are 

^|^ = u{x, y, , (18) 

where the P'’s are arbitrary real constants and u{x, y, z) is an 
arbitrary well-behaved function of the internal coordinates. Now, 
if we operate on (18) with H and write 

we get 

= H‘^1^ + = E‘^1^ + H'i, (20) 


so that if (18) is to satisfy the Schroedinger equation = E\ff 
we must have 

+ HV = JSi. (21) 

When (18) is explicitly substituted into (21) the exponential 
factor cancels out, and we are left with 

E°u(x, y, 2 ) + H*uix, y, z) = Eu{x, y, 2 ). (22) 


If we finally write 

E - E° = E\ (23) 

Equation (22) becomes 

H'u{x, y, z) = E'v,{x, y, z). (24) 

We therefore conclude that the simultaneous eigenfunctions of 
the variables (17) have the form (18) where v.{x, y, 2 ) is a well- 
behaved function satisfying (24), and (since the values of the 
P'’s remain unrestricted) that the quantity E°, which we may 
call the translational energy of our system, is not quantized. 



470 


THE SCHROEDINGER METHOD 


§81 


Now, in view of (16), Equation (24) is just the Schroedinger 
equation for a single particle of mass m moving in the fixed Cou- 
lomb potential — and consequently the possible values of the 
parameter E' in (24), that is, of the internal energy of our system, 
can be obtained from the results of the preceding section by 
writing n for m. The conclusion is that all positive values of the 
internal energy are possible, while its possible negative values are 

= ^ = 1.2,3,-. (25) 


Exercise 

1. Compare the energy levels of the internal motion of the hydrogen 
atom, the deuterium atom, and a singly ionized helium atom of mass 4. 

82 . The Fine Structure of the Hydrogen Levels 

When the spectrum of atomic hydrogen is studied with apparatus 
of moderate resolving power, the discrete energy levels are found 
to be just those given by the Bohr formula 

(18*“) En = -27r^meyn^h\ (1) 

provided this formula is corrected for nuclear motion,*^ as we 
have done in §81. But when high resolution is employed it is 
found that the levels which the Bohr formula requires to be 
singlets are, in fact, with the exception of the lowest level, very 
narrow multiplets. 

The fine structure of the lower portion of the hydrogen energy 
spectrum is shown schematically in’* Fig. 107. The long thin 
lines mark the positions of the singlet levels required by the Bohr 
formula; these levels are shown as equally spaced, although their 
separations, in fact, decrease with increasing n. The short thick 
lines show the actual levels. The displacement of each member 
of the nth multiplet below the nth Bohr level, measured in terin.s 
of the unit 

meV2c’ftV, (2) 


" In the remainder of our work with the hydrogen atom we shall dis- 
regard the corrections for nuclear motion. 

_ “ By permission of The Cambridge University Press and The Mac- 
millan Company, publishers, our energy-level diagrams for hydrogen arc 
patterned after Fig. 2 ‘ of Condon and Shortley. 



471 


MOTION IN THREE DIMENSIONS 


§82 


is indicated by the number at the right of 
the corresponding short line. Since the unit 
(2) decreases with 7i, the diagram exagger- 
ates the widths of the higher multiplets; it 
also exaggerates the separations of the mem- 
bers of a multiplet compared to the separations 
of the Bohr levels. 

The formula that gives the levels correctly 
was obtained in 1915 by Sommerfekl,^® who 
extended Bohr^s quantization rules to an 
electron moving in a Coulomb field accord- 
ing to the laws of Einstein \s theory of rela- 
tivity, rather than the laws of Newtonian 
mechanics. Sommerfeld\s fine-structure for- 
mula is 

where the principal quantum number n 
takes on the values 

n = 1, 2, 3, • • • , 



1 

% 

5 


— 13 





Fig. 107, Fine 
structure of hy- 
drogen levels. 


(4) 


and the auxiliary quantum number k takes on, for a prescribed 
value of n, the n values 


/c = 1, 2, • • • , ?i. 


(5) 


The symbol c denotes the speed of light, while o£, defined as 


a = 


ch 


( 6 ) 


and called the Sornmerjeld finc-structure constant, is a pure number 
whose value is approximately xir- 

Since a is very small compared to /c^, the term —h in (3) is 
almost canceled by the radical following this term, and therefore 
the levels having the same n but different /c^s lie very close together 
compared with levels having different the formula (3) thus 
describes a series of narrow multiplets. To re/’eal their structure 
more clearly we expand the right side of (3) into a power series 


3® A. Sommerfeld, Annalen der Physik, 61, 1 (1916). 



§82 


472 


THE SCHROEDINGER METHOD 


in a and then drop the terms contming l/c‘, 1/c®, and so on, 
The resulting highly accurate approximatson is 




(7) 


where is given by (1) and the factor 1 En 1 is just the 

quantity (2). The term me* in (7) is called the relativistic rest 

energy of the electron. 

The levels computed by means of (7), with the rest energy left 
out, and their Sommerfeld classification in terms of the quantum 

numbers n and k are shown on the 
right side of Fig. 108; the left 
part of this figure is a reproduction 
of Fig. 107 (Fig. 107 has, in fact, 
been computed from Equation (7) 
rather than from experimental 


fc-l fc-2 fc-3 *-4 


1 

• — Vi 
, 5 






Fig. 108 . Fine-structure 
levels of hydrogen, and their 
Sommerfeld classification. 


data). 

So far, our quafitum-mechanical 
computations concerning the hy- 
drogen atom have led to the Bohr 
formula; but since Soiximerfeld^s, 
rather than Bohr's, formula agrees 
with experiment, we should expect 
the theory to yield Sommerfeld's 
formula or one much like it. Our 
study is thus as yet incomplete; 
and, guided by the fact that Som- 
merfeld's computations invoked the 
theory of relativity, we proceed to 
investigate the relativity corrections 
to our quantum-mechanical result. 


Exexcises 

1, Deduce (7) from (3). 

2. Extend Fig. 108 through n =* 5. 

83. Relativity Corrections for Hydrogen 

In relativity theory, the Newtonian relation jE7 — F - 
(Px + vl + P*)/2m between the kinetic energy JS/ — V and the 



473 MOTION IN THREE DIMENSIONS §83 


Cartesian components Px )pv j and p* of the linear momentum of a 
particle is replaced by the relation 

\ {E - vy ^pI + pI + pI + (1) 

where c is the speed of light and m is the so-called rest mass of the 
particle, that is, mass as determined by an observer with respect 
to whom the particle is at rest. In the case of a free particle at 
rest, or, more precisely, in the case 7 = 0 and = p, = 0, 

Equation (1) reduces to E* = mV and yields for the energy of 
the particle the positive value 

Eo — mc^ ( 2 ) 

and the negative value —me*. The second alternative is of no 
interest in classical relativity, while (2) is interpreted to mean 
that, even when at rest, the particle possesses a mass energy or 
rest energy of the amount me*. According to relativity theory, 
a Cartesian component of the linear momentum of a particle may 
have any value between — oo and oo , while the speed of the 
particle cannot exceed the spee d of light c; hence the relativistic 
relation (p* = m*/-v/l — and so on) between linear mo- 
mentum and velocity, which we need not discuss here, is more 
complicated than the corresponding Newtonian relation. 

The fundamental problem of the relativistic quantum me- 
chanics of a single particle, namely, the construction of the 
operator H that should replace the Schroedinger Hamiltonian 
operator when the Newtonian energy equation is replaced by the 
relativistic equation (1), will be taken up in §91. For the present 
we shall employ a somewhat indirect procedure that avoids the 
use of the explicit form of the operator H] critical comments on 
this procedure will be made in §91. 

We consider a state ^ for which the total energy of the system 
is certainly E, rewrite (1) as 

1 (3) 


and find that the Schroedinger operator associated with the quan- 
tity on the left side of (3) is 


i (E - vy 4- 





(4) 



474 


THE SCHROEDINGER METHOD 


§83 


where E is the numerical value of the total energy and V is the 
Schroedinger operator associated with the potential energy. Now, 
according to (3), the quantity on the left side of (3) is simply the 
number mV; we interpret this fact to mean quantum mechanically 
that the Schroedinger function ^ describing an energy state of 
energy E is an eigenfunction of the operator (4) belonging to the 
eigenvalue mV of (4), and write 




02/2 



= rr^cyl/ . 


(5) 


The energy levels of the system can now be found by computing 
the numerical values of E for which (5) has well-behaved solutions. 
Recalling the steps from to 3^®, we may rewrite (5) as 




c* J 


m^cV > 


( 6 > 


where and pr are the operators 65^® and 24^^ 

Let us now consider central fields, when V in (6) has the 
form 7(r). The straightforward mathematical procedure for 
studying (6) in this case is to try separating the variables by 
assuming that \p has the form i2(r)0(0)4>(<^). We can, however, 
skip a step or two because nonrelativistic analogy suggests that 
L, and are constants of motion in our case, and that hence, 
according to the expansion theorem, we may restrict ourselves at 
once to \l/^s of the more specific form 


(68'®) rp = Rr(}mi). (7) 


Substituting (7) into (6), where now V = V{r), and using 15^, 
we find that the angle factors cancel out and that we are left with 
the radial equation 


+ R = mVR. <8) 


The cancellation of the angle factors provides a mathematical 
justification for the form (7). 

When we finally restrict ourselves to the Coulomb case 


( 1 “) 


V = -eVr 


(9) 



475 


MOTION IN THREE DIMENSIONS 


§83 


and make our standard substitution 
(10”) X = rB, 

Equation (8) reduces to 

f_ 1 ^ , fKL+l) _ jLI I _ A i“\ 

\ ic L « 2mc? J r® mc^ r / ^ 

[ 1 j1 


( 10 ) 


( 11 ) 


Apart from the differences in the numerical coefficients, our 
new x-equation (11) is of the same form as the nonrelativistic 
X-equation 


(4“) 


f_ 1 , Ki + 1) 1 

\ K dr^ K 


j}x - Sx. 


( 12 ) 


Hence the precise values of E for which (11) has satisfactory solu- 
tions can be found in essentially the same way as in the case of 
(12), and we may omit details/® It turns out that the positive^' 
values of E for which (11) has solutions yielding well-behaved 
jK’s are the continuum of all values lying above me® and also the 
discrete values 


Eni me [l + (^ _ ^ ^ ] > (13) 

where a is the fine-structure constant i[ chj where the principal 
quantum number n takes on the values 


n = 1, 2, 3, . ■ . , (14) 

and where, for a prescribed n, the azimuthal quantum number I 
runs through the n values 


Z = 0, 1,2, , (n - 1).. (16) 


If the right side of (13) is expanded in powers of a, and the terms 
containing 1/c^, 1/c®, and so on, are dropped, (13) reduces to the 
approximate formula 


Enl = Bn 


\En\a ( 4n 

4n2 V + i 




-1- me®, 


(16) 


Sommerfeld, page 112. 

Equation (11) contains as well as J^, and yields both positive and 
negative energy levels; the latter are of no interest in the work of this 
section. 



476 


THE SCHROEDINGER METHOD 


§83 



where Bn is the Bohr energy 18“. 
The second of the three terms on 
the right side of (16) is called the 
relativity correction for hydrogen. 

Equation (13) is of substantially 
the same form as the Sommerfeld 
fine-structure formula 3®^; but, since 
Sommerfeld’s k is an integer, while 
our Z -f J is not, the two formulas 
differ in an important detail. The 
levels given by (16) are contrasted 
with the correct levels in Fig. 109. 
We conclude that, in quantum me- 
chanics, in contradistinction to the 
Bohr theory, the relativity correc- 
tion alone is not suflficient to bring 
agreement with experiment. 

We shall find in §89 that the ap- 
proximate Sommerfeld formula 7®* 
can be obtained by combining the 
relativity correction with a correc- 


Fig. 109. Relativity correc- tionfor the so-called electron spin; 
tions for hydrogen. The Som- and in §96 we shall see how the fine 
merfeld levels are shown at the structure of the hydrogen levels 

... . . can be found precisely with the help 

or the correct relativistic Hamiltonian operator. 


Exercises 


1. Extend Pig. 109 through n = 5. 

relati-Wstk ^of'//n reduces to (12); remember that the 

J (mdoes wt ® nonrelativistio E 



Chapter XIII 

ELEMENTS OF PAULFS THEORY OF ELECTRON SPIN 


84. The Electron-Spin Hypothesis 

Our failure to g('t quantum mechanically a correct formula for 
th(^ hydrogen levels may be ascribed to the fact that we have so 
far disregarded the phenomenon of electron spin. This phenome- 
non has no classical analogue, and in approaching its theory we 
shall begin by recounting.an empirical clue. 

Bohr’s theory does not lend itself to an accurate computation 
of the energy levels of atoms having several electrons; but it does 
enable one to infer quite a few important features of the levels, 
and during the decade following its inception a large amount of 
work was done in trying to apply it to the interpretation of 
spectral data. It was found, however, that a fairly coherent 
picture could be obtained only with the help of certain extraneous 
assumptions. 

In 1925, Uhlenbeck and Goudsmit^ proposed that each electron 
spins while revolving about a nucleus and has a quantized spin 
angular momentum and, being an electrically charged body,^ a spin 
magnetic moment This proposal was remarkably successful in 
correlating spectral data for both unperturbed and perturbed 
coinplex atoms, and in interpreting the results of the Stern- 
Gerlach experiment concerned with the deflection of atoms pro- 
jected into an inhomogeneous magnetic field; but the reason why 
Sommerfeld succeeded in getting a correct formula for hydrogen 
without introducing electron spin became obscure. 

The essence of the Uhlenbeck-Goudsmit hypothesis is that® 

(a) each electron has a spin angular momentum ^ whose com- 

^ G, E. Uhlenbeck and S. Goudsmit, Naturwissenschaften^ 13, 953 (1925); 
Nature, 117, 264 (1926). 

* S. J. Barnett, “Gyromagnetic and Electron-Inertia Effects," Rev. 
Mod. Physics, 7, 129 (1935), 

’ Our present are not to be confused with probability-current 
densities. 


477 



478 PAULIES THEORY OF SPIN §84 


ponent in any direction can have only the value or the value 
so that, in particular, 

Sg = zt^hj ( 1 ) 

and that 

(b) each electron has a spin naagnetic moment y, such that, in 
electromagnetic units. 


Here — e is the charge, m the mass, and s the spin angular momen- 
tum of the electron, while c is the ratio of electrostatic and electro- 
magnetic units, that is, the speed of light. 

From the standpoint of the Bohr theory, and also from the 
standpoint of that part of quantum mechanics which we have 
discussed so far, the assumption (a) seems strange for two" main 
reasons: the possible values of are taken to be halves rather 
than integral multiples of the Bohr unit of angular momentum ft, 
and only two values of Sz are assumed possible. The assumption 
(b) is also somewhat strange because the factor e/mc in (2) is 
larger than we might expect, that is, the magnetic moment of the 
electron is assumed to be anomalously large when compared with 
the assumed spin angular momentum of the electron. Inci- 
dentally, according to classical theory, the magnetic moment of 
an atom arising because of the orbital motion of an electron is 


where L is the orbital angular momentum of the electron; the 
expre.ssion (3) is supported by spectroscopic evidence. 

We shall see in the next chapter that the phenomenon of 
electron spin finds a direct quantum-mechanical interpretation as 
soon as we satisfy the requirements of the theory of relativity in 
a thoroughgoing manner. But jfirst, with the help of a method 
due to Pauli, we shall show how the hypothesis of electron spin 
can be grafted onto nonrelativistic quantum mechanics. 

Both values of s* assumed in (1) vanish when ft 0, so that 
electron spin disappears in the limit ft 0 and thus ha^ no classi- 
cal analogue (the situation is of course dijGferent in the case of 
orbital angular momentum because, since mi = 0, =hl, =t2, • • • , 


* W. Pauli, Zeits. f. Phymk, 43, 601 (1927). 
relativistic form of Pauli's method. 


We shall not discuss the 



479 


PAULPS THEORY OF SPIN 


§84 


a decrease of the factor h in inih does not delimit the value of 
mih). Hence there is no obvious way (in fact, there is not any 
way) in which the operators that might be associated with Sz 
can be constructed from the operators associated with our familiar 
classical dynamical variables. 

In the applications of the electron-spin idea to the Bohr theory, 
the spin angular momentum is treated on quite the same footing 
as orbital angular momentum; and the success of these applica- 
tions would therefore suggest that at least the more fundamental 
properties of spin angular momentum must be similar to those 
of orbital angular momentum. Now, perhaps the most funda- 
mental property of classical orbital angular momentum is the set 
of Poisson* bracket relations 15^®, so that perhaps the most funda- 
mental property of the quantum-mechanical operators associated 
with the components of orbital angular momentum is the set of 
relations^ 


< 70 ’®) 


Ijz Lx — Lx Lz ~ i hLy . 


(4) 


Taking the hint from (4), w'o now assume that the operators Sx, 
Sy , and Sz to bo associated with the components of spin angular 
momentum satisfy the commutation rules 

““ %fhSz ) ^y^z ““ ^z^y ““ ^hSx ) ~~ SxSx %ftSy • (^) 

We shall show in the next section that the eigenvalues required 
by (1) are compatible with these commutation rules. 


Exercise 

1. Recall Exercises 9^” and KFS and derive PJquations 12^, and 18^ 
from liquations 2*^ to 0^, In a similar way verify the terms containing 
dR/dr in le"" and 17"^. 


86. Possible Values of Components of Angular Momenta® 

We shall now compute symbolically the eigenvalues of three 
Hermitian operators, Jx , Jy , and Jz , defined through the (rom- 
inutation rules 


f / xJ y J yJ X “ ihiJ z } *7 z ““ ^ **/ y ipiJ x j 

z*^ X xJ z “ %flj y j 


( 1 ) 


^ We no longer use angular brackets in equation numbers. 

® Condon and Shortley, page 46. The reader who omitted Chapter XI 
may also omit §85. 



480 


PAULFS THEORY OF SPIN 


§85 


that is, rules of the form 4^^ and 5®^; we use the symbols J to 
emphasize that in this section wo are not restricting ourselves to 
orbital angular momenta denoted by or to spin angular 
momenta denoted by s’s, but are instead treating angular momenta 
in general. We note that the operator 

J" = Jl + Jl + Jl (2) 

commutes with Jx, Jy j and Jz , and that 

(«^a: "h = {,Jz — (t/a: "h • (3) 

Since J”* , /y , and Jz enter symmetrically in (1), the eigenvalues 
of one of these operators are the same as those of another, and we 
shall focus our attention on Jz . Further, we shall restrict our- 
selves to that are simultaneous eigen-yp^s of Jz and J^; the 
permissibility of this follows from the general expansion theorem 
quoted a few lines below 1^®. Each of our four J^s is Hermitian, 
and consequently the eigenvalues of each are all real. 

Let yphesi simultaneous eigen-yp of J, and J\ belonging to the 
eigenvalue a of and the eigenvalue 6^ of so that 

Jzyp = aypj J^yp = b^xPj yp 7^ 0, (4a, b, c) 

From (4a) we get 

(t/x + iJv)Jz^ == a{Jx + iJy)ypj (5) 

or, according to (3), (J. ~ h)(Jx + ij,)yp = a(Jx + iJy)yp, that is, 

Jz[(J. + iJy)yp] = (a + h)[(Jx + ijy)ypl (6) 

Hence, if a ^ an eigenvalue of , then a + ft is also an eigenvalue 
of Jz , provided the operand (/* + ijy)yp does not vanish. Since 

A(Jx -j- iJM = {Jz + iJt)J^i = {J^ + 

= ^ [{Jx + iJi^^P], (7) 
the operand (/* -j- iJy)ypj unless it vanishes, is an eigen-i/' of J^ 
belonging to the eigenvalue 6^ of J^, the same eigenvalue that 
belongs to yp. Iterating this argument, we get the series 

a, a + ft, a + 2ft, a + 3ft, • • • (8) 

for the eigenvalues of J, , and find that each of the corresponding 
eigen^Js of J, is an eigen-yp of J^ belonging to the eigenvalue 



481 


PAULPS THEORY OF SPIN 


§85 


Multiplying (4a) by J* - iJy , rather than by + iJy , 
we find by analogous arguments that, if a is an eigenvalue of Jz , 
then a — ft is also an eigenvalue of Jz , so that the series (8) 
becomes augmented to 

• • • , a — 2ft, a — ft, a, a -f ft, a + 2ft, • • • , (9) 

For any state, av = av + av Jl + av , and consequently 
for a simultaneous eigenstate of Jz and J^ the square of the eigen- 
value of Jz cannot be greater than the eigenvalue of Now, 
the respective eigen-^/^^s of Jz belonging to its eigenvalues (9) 
are all eigen-i/'^s of J^ belonging to the eigenvalue 6^ of and 
therefore the series (9) must terminate on both sides, for otherwise 
it would include terms wliose squares are greater than 

Ijet A be the greatest (ugenvaluc of Jz consistent with the 
eigenvalue 6^ of and let the corresponding simultaneous 
eigen-i/^ of Jz and J^ be . Substituting A for a in (6), we get 
Jz[{Jx + '^Jv)^a] = (A -b ft)[(Jx + so that, contrary to 

hypothesis, Jz would have the oigonvahie i4 ft unless 

{Jz + iJy)^A = 0. (10) 

Multiplying (10) by — iJy and rearranging the result, wc get 
{Jz — '^Jy){Jx + iJv)’^A “ \.j z 'J V H“ i{JzJy — J yJ z)\P A 

= {Jl + 4 - Mz)^a - {J^ -J\- hJz)^A - {b^ -A^-^ hA)^p^ 

= 0, so that, since \f/A 9^ 0, 

- A' ~ ftA = 0. (11) 

The greater root of (11), which alone is of interest to us, is 

A = (12) 

A similar argument shows that the least eigenvalue of 7, consistent 
with the eigenvalue 6^ of ,f is — I'v/i'i® + 46^ that is, —A. 

The eigenvalues of J, consistent with a preassigned eigenvalue 
of thus form the series 

— A, — A -j- hj — A -f- • • • , A — 2/i, A — fi, A, (13) 

and the difference between A and —A must therefore be an 
integral multiple of h, so that 2 A = m'h, w'herc m' is a positive 
integer or zero, that is, 

A = fm'ft, m' = 0, 1, 2, . • • . ‘ (14) 



482 


PAULrS THEORY OF SPIN 


§85 


If m! is even, say m' = 2m, then (13) takes the form 

— mhj — (w — l)hj • * • , — ft, 0, , (m — l)ft, mft, (15) 

where m = 0, 1, 2, • • • ; but if m' is odd, say m' = 2m + Ij we get 
-(^ + -(m - ^)ft, ••• , ~§ft, |ft, ••• , (jn — i)ft, 

(m + J)ft, (16) 

where, as before, ??i = 0, 1, 2, • • • . 

The result (15), w^hich means that the eigenvalues of a com- 
ponent of an angular momentum may be integral multiples of ft, 
is in agreement with our old result 62^® obtained for orbital angular 
momenta. But the result (16), which means that the eigenvalues 
of a component of an angular momentum may be half-odd integer 
multiples of ft, is new; according to it, the assumption that the 
possible values of a component of spin angular m.omentum are \h 
and — §ft is not inconsistent with quantum mechanics, provided the 
quantum-mechanical definition of angular momenta is taken to 
be the commutation rules (1). 

We summarize (15) and (16) by writing 

*/« = m/ft, mj = 0, ±5, ±1, dbf, ±2, • • • . (17) 

Exercises 

1. Justify in detail the step from (8) to (9), and verify that the smallest 
eigenvalue of consistent with the eigenvalue of is the negative 
of (12). 

2. Show that 

J^Vj(jTT)h, y = o, 4 , 1,1, 2 , ••• (18) 

and note that (18) includes 69’'®. 

3. Correlate Equations 12^ to 15^ with the various intermediate results 
obtained symbolically in this section. 

86, The Pauli Operators and Operands 

The spin operators are to satisfy the commutation rules 

(5 ) SxSy — SySx ~* ihsg , SySz — SgSy — ihsx , SgSx — SxSg “ ihsy . (1) 

Further, according to the assumption (a) of §84, each of the 
operators s®, Sy , and Sg must have just the two eigenvalues |ft 
and —l-ft; in view of 16®® these eigenvalues are not inconsistent 
with (1). 



483 


PAULFS THEORY OF SPIN 


We introduce three auxiliary operators, cr» , , and o-, , such 

that 

, Sg ^ ^fl<Tg , (2) 

The following properties of the <r’s are of imnaediate interest to us. 
Since the eigenvalues of each s are to be just P and the 
eigenvalues of each a must be j ust 1 and - 1 . Each of the operators 
<r*, ffl, and <rl must therefore have only the one eigenvalue 1; 
and since the only operator having the lone eigenvalue 1 is the 
unit operator we conclude that 

2 2 2 1 /^v 

O’* — CTy — (T, — 1. (3) 

According to (1) the commutation rules satisfied by the o-'s 
must be , 

= QiiCg f CTyCz — (TgCy 2ii(Tx j 

CtCTx O' zd z “ ^idy . 

Now, 

2i{dxdy d yd x) — {^idx)dy -|- dy(2idx) (5) 

= {dydx dzdy')dy -j- dy^dydg d gd y') 

~ "^dgdy dydg — dz "i” dz 

= 0 . 


Hence dgdy = —dydx, so that dx and dy anticommuto; similar 
arguments show that any two of the cr’s anticommute: 

dxdy — ’"“‘dydx j dydg — —dzdy , d gd x “ “~dxdz . (6) 

Finally, it follows from (4) and (6) that 

dxdy = idz , dydz — idx , dzdx — idy . (7% b, C) 

We now look for a set of explicit operators having the properties 
of the (j's. Since each d has just two eigenvalues, 2-by-2 matrices 
may be expected to serve the purpose, and we begin by associating 
with dg the simplest 2-by-2 matrix having the eigenvalues 1 
and —1: 

-a -.) 

Now, 

/a b\n 0 \ _ /o -b\ 

\c d/VO -ij “ \c -d) 


( 9 ) 



484 


PAULI’S THEOEY OF SPIN 


SO that every matrix that anticommutes with (8), as <r* and ay 
do according to (6), must have the form 



( 11 ) 


The eigenvalues of (11) are zh\/6c, so that if they are to be 1 and 
— 1 we must set 6c = 1. Perhaps the simplest possibility is to 

let 6 = c = 1, and to adopt for o-* the matrix It then 

follows from (7c) that the matrix to be associated with o-y is 

/o “A .u 

I . ^1, and hence our complete list of a's becomes 


<r* = 




(Tz = 



. ( 12 ) 


The matrices (12) are called the Pauli spin matrices. 

Using (12) in (2), we get the following set of specific operators 
suitable for associating with the components of spin angular 
momentum: 


Two-component ^’s. Since the Pauli theory uses the operators 
(13), which are 2-by-2 matrices, the Pauli operands are two- 
component column symbols; and, since the theory retains in 
essence the Schroedinger operators for dynamical variables having 
classical analogues, the components of these column symbols are 
factions of x, y, and z (or of r, 6, and <#>; and so forth). An 
(instantaneous) Pauli operand \p thus has the form^ 



or, as we shall write for brevity, 

* ‘ (!^)' 


’ Two-component V'-funotions were first used, in treating spin, by C. G. 
Darwin: Nature, 119, 282 (1927); Proc. Roy. Soe., 116A, 227 (1927). Pauli’s 
mathematical procedure is equivalent to Darwin’s in spite of a difference 
in the approach to the problem. 



485 


PAULI’S THEORY OF SPIN 


§86 


A Pauli ^ is said to be welUhehaved if at least one of its components 
is different from zero and if each of its nonvanishing components 
is well-behaved in the sense of the Schroedinger theory. The 
Pauli ^ corresponding to (16) is the row symbol 

(l”) ^ = &i, h), (16) 


so that ^ is the following ordinary function of x, y, and z: 


If 


W = (^1, V'*) 



"b ^sV'8- 


(17) 



(18) 


that is, if / (ifiV'i -b ^ 2 ^s)dT = 1, then the Pauli ^.is said to be 
normalized; as indicated by the crossed d, the integral in (18) 
extends over the entire three-dimensional space. 

Further Pauli operators. In order that a differential® operator 
{ may combine with the spin matrices or operate on two-com- 
ponent it is first thrown into a matrix form by the following 
rule: 


For example, the Pauli operators associated with x, p* , and L, are 



where the matrix elements are the appropriate Schroedinger 
operators. Since the Schroedinger operators associated with *, 
, and Lt are x, —ih d/dx, and —ih d/d4>, the explicit forms of 
the Pauli operators (20) are 



•As before, the term differential operator -onotes an operator con- 
structed from one or more of the operators x, d/dx^ 9/3t/, and so on. 



486 PAULFS THEORY OF SPIN §86 


The various Pauli operators are added together by the standard 
rules of matrix algebra; for example, 



The Pauli operators are multiplied together by the standard 
rules of matrix algebra; for example, 



But when multiplying together matrices with noncommuting 
elements we must preserve both the order of the matrix factors 
and the order of the element factors. For example, 



The Pauli operators operate on the two-component yp’s ac- 
cording to the standard rule 51^. For example, 

"©"C o*)©=(i;r) ® 

-0 ©=(-©■ <“> 



487 


PAULI’S THEORY OF SPIN 


Similarly, 




( 29 ) 


The rule remains formally the same when the elements of a matrix 
are differential operators; for example, 


= 


p* 0\ /i/'A _ /px\pi\ _ / — ihd\l/i/dx\ 
.0 Vx)\w \Px\p 2 / "" \-ihdh/dx/ 


(30) 


A Pauli operator can usually be written in various forms having 
different degrees of explicitness. For example, in the case of the 
x-component of linear momentum, we have the alternative forms 



and so on, any one of which can be substituted f6r another as 
occasion demands. 

Eigenvalues and eigen-ife^s are defined in Paulies theory in our 
standard way: if an operator a, a well-behaved operand i/', and 
a number X satisfy the equation 


onA = XiA, (32) 

then \p is said to be an eigen-xl/ of a, X is said to be an eigenvalue 
of a, and the eigen-i/^ \l/ and the eigenvalue X of a are said to belong 
to each other. 

If a is a matrix with numerical elements, then (32) is an ordinary 
matrix eigenvalue equation, and the eigenvalues of a are computed 
by the methods of §56, that is, by means of the secular equation. 
The eigen-^^^s are then also found by the standard matrix methods, 
except that and 1/^2 are now allowed to be well-behaved functions 
of x, 2 /, and z. 

If oi is a differential operator, then, according to (19), Equation 
(32) takes the form 



( 33 ) 



488 


PAULI’S THEORY OF SPIN 


and splits up into the following pair of simultaneous equations 

which does not have dependent variables in common, and each 
member of which is just the Schroedinger eigenvalue equation for 
the operator a. It follows, in particular, that the Pauli theory 
and the Schroedinger theory agree as to the possible values of 
such dynamical variables as linear momenta and orbital angular 
momenta. 

If a involves both matrices and differential operators, then (32) 
again splits up into a pair of (usually differential) equations; if 
the matrices involved in a are not diagonal, then \pi and ^2 become 
interrelated, and the problem of solving the equations for and 
\pz may become complicated. 

The handling of the eigenvalue equation will be illustrated in 
the Exercises and in §87. 

Concluding remarks. Every Pauli ^ pertaining to a system 
whose Hamiltonian is H is required to depend on the time t in 
such a way as to satisfy the equation 

= ih (36) 


where H is the Pauli operator associated with the Hamiltonian, and 



Equation (35) has the form of the second Schroedinger equation. 

The expected average of a d 3 mamical variable a for a state \f^ 
is taken in Pauli's theory to be 


av^Q! = 


/ ^oc^/ dr 

J^<Jr * 


(37) 


where a in the numerator is the Pauli operator associated with the 
dynamical variable <x and where, as indicated, the integrals extend 
over all space. Note that the Pauli expectation formula (37) is 
formally identical with the Schroedinger expectation formula 8^. 


Exercises 

1. Show that the rules given in the text for handling Pauli operators 
are consistent with the definitions of operator sum and operator product 
adopted in §2. 



489 


PAULI’S THEORY OF SPIN 


2. Verify that the Pauli operators x and satisfy the equation 

/ fCt \ 

3. Show that, if = ( * )i then av^ Lz ~ 0, av^ LJ = 

av^ LI *= 0, and so on. What is the distribution-in-L, for this state? 

4. Let ^(o-t) denote the eigen-^’s of the spin matrix <r< belonging to 
the eigenvalue 1 of , and ^(<r7) denote the eigen -^’s of <ri belonging to 
the eigenvalue —1 of <r< . Show that then 



where each f and g is an arbitrary well-behaved function of x, y, and z. 
Note that is an eigen-^^ of Sx belonging to the eigenvalue ih of s, , 
and so forth. 

6. Show that, for a state for which the value of any one of the three 
variables s» , Sy , and a, is a certainty, the values of each of the other two 
a’s are equally likely to be i /I or — 

6. Adopt Sx cos a 4- cos j8 4 a, cos y (that is, the expression suggested 
by classical analogy) for the operator associated with the component of 
spin angular momentum a in the direction making the angles «, /5, and y 
with the X-, y-, and «-axis, and show that the possible values of a com- 
ponent of a, quite independently of the direction in which the component 
may be taken, are :kih. 

7. Let / denote the total angular momentum of the electron (Fig. 110), 
so that 

/ = L 4 

“ Lx 4 I Jy ^ Ly 4 > J z ^ Lz 4 

and 

/« = JJ + j; + j; = (L. + «,)» + {Ly + «»)> + (L. 4- «.)». (40) 


Show that the Pauli operators associated with /i and /* are 


( 22 ) 

and 


^ 0 \ 
V, 0 L.- ihj 




( 


X* 4 hLz 4 h{Lx - 
h(Lx 4 iLy) U 


{Lx-iLy) \ 

-^L.4 W 


(41) 

(42) 


where the L’s are the appropriate Schroedinger operators. 

8. Show that the operators /* , /y , and /, satisfy the commutation 
rules P®. 

9. Show that the Pauli operators /*, /, , and L* commute with each 
other. In these computations, the forms /* *= L* 4 4^# and /* »= 
L* 4 HL»(rx 4 Lyffy 4 LgiTz) 4 5^* are more convenient than the more 
explicit forms (41) and (42), 



490 


PAULrS THEORY OF SPIN 


§87 


87. The Simultaneous Eigen-^’s of and P 

Let us now consider briefly an electron that possesses a spin 
angular momentum and a spin magnetic moment and that moves 
classically in a central electrostatic field. The orbital angular 
momentum L of the electron, its spin angular momentum 5, and 
its total angular momentum J are shown diagramatically in 

Since the central electrostatic field exerts no 
torque on our electron, its total angular momen- 
tum is conserved: 

(a) the total angular momentum J is a con- 
stant of motion; that is, Jx , Jy , Jz i and J are 
each a constant of motion. 

Now, the orbital motion of the electron, which 
may be likened to the flow of an electric current, 
produces an orbital magnetic moment, whose 
value is given by This magnetic moment 
exerts a torque on the spin magnetic moment and 
thus tends to alter the direction, though not the 
magnitude, of the latter, that is, it causes the 
latter to precess; and, since the spin angular 
momentum and the spin magnetic moment are 
proportional to one another, we conclude that 

(b) the spin angular momentum s undergoes a precession and is 
therefore not a constant of motion; but its magnitude s is a 
constant of motion. 

Considering the torque exerted by the spin magnetic moment 
on the orbital magnetic moment, we find in a similar way that 

(c) the angular momentum L undergoes a precession and is 
therefore not a constant of motion; but its magnitude L is a 
constant of motion. 

It finally follows from (a), (b), and (c) that 

(d) the vectors L and 5 precess about the vector 7. 

These precessions are indicated in Fig. 110 by the curved arrows. 

We shall see in the next section that in the Pauli theory of an 
electron in a central electrostatic field the dynamical variables 
Jx J Jy J Jg f J, Sj and L are constants of motion, just as the classi- 
cal arguments indicated above would lead us to expect. In 
particular, the dynamical variables 

Jz , 7 ^ 


Fig. no. 



Fig. no. 
Classical vector 
diagram for or- 
bital, spin, and 
total angular 
momenta. 


( 1 ) 



491 


PATJLrS THEORY OF SPIN 


§87 


which were shown in Exercise 9®® to commute with one another, 
are a set of commutiTig constants of motion. We prefer the set (1) 
to such sets as Jx , and or , L, and J, because of the 
relative convenience of the operators involved. We omit s and 
from the set because these constants of motion are merely numeri- 
cal constants and therefore are not helpful in computations ; indeed, 
si + si + si = m\cl + al + crl) = f and s - 
In computing the eigen-xp^s of a Hamiltonian H, we may, 
according to the expansion theorem, restrict ourselves to those 
eigen-xp^s of H which are simultaneously eigen-xp^s of a set of 
commuting constants of motion. In particular, in treating the 
central motion of an electron by the Pauli method we may restrict 
ourselves to those eigen-i/'^s of H which are simultaneously eigen- 
}p^s of Jt , Li^j and Therefore, in preparation for the problem 
of central motion, we shall now compute the simultaneous eigen- 
xp^s of Jz , and 

The eigen-^*s of We denote the eigenvalues of by 

mhf where m is as yet undetermined.® In view of 41®®, the 
eigenvalue equation 

J ^ mhxp ( 2 ) 

takes the form 

e:'‘ « 


and splits up into the two equations 


that is, 


I (.Lx + = mhj/i 

[ {Lx — = mhh , 


( 4 ) 


( 5 ) 


The respective general 
(5) are 

xpi = Ai(r, 6)e 


solutions of the differential equations 

.(m-i)* ^ (6) 


• This m must not be confused with the mass of the electron. A more 
explicit symbol for the eigenvalues of /• is mjh. 



492 PAULI’S THEORY OP SPIN §87 


and we note that to assure single-valuedness of 4^1 4't we must 

restrict m to the values dbf, and hence 

Jt = mh, m = ij, d=f, • • • . (7) 


The case of a single spinning electron thus corresponds to the 
possibility 16“ rather than 15®*. 

The eigen-^’s of J* belonging to the eigenvalue mh of , 
which we denote by ^(m), are, accordingly, 




( 8 ) 


where the values of m are those given in (7) and where -4i(r, &) 
and A*i(r, 6) are arbitrary well-behaved functions of r and 
except that one of them can be taken as zero. 

The simultaneous eigen-^’s of Jg and LK Since the eigen- 
values of are known to be Z(Z + 1)^^, with Z = 0, 1, 2, • • ■ , 
we write the eigenvalue equation for from the outset as 

This equation splits up into the two Schroedinger eigenvalue 
equations 

and.it follows from 67^* that 


4^1 = Rini, m'), = R^ril, m"), (11) 

or, more explicitly (we absorb the- factor (27r)“* of the T’s in 
the iJ’s), 

4^1 = 4>^ = RiQt'e'”'"*, ( 12 ) 

where B,\ and Rt are arbitrary well-behaved functions of r, and 
m! and m" are integers, zero included, such that \ m' \ < I and 
1 m" I < Z. II- 

Comparison of (11) with (8) now shows that to get a ^ [we 
denote such a ^ by ^(Zm)] that is simultaneously an eigen-^ of 
J, belonging to the eigenvalue mh of J, and an eigen-^ of L* 



493 PAULI’S THEORY OP SPIN §87 


belonging to the eigenvalue i(Z + of L® we must set m' = — § 

and to" = m + so that 


,n ._/Rinirn-^)\ 


(13) 


The well-behaved radial factors Ri and Rz remain undetermined 
and unrelated, except that at most one of them can be taken 
as zero. 

The simultaneous eigenfunctions of and JK We finally 

proceed to adjust (13) so as to make it an eigenfunction of J^; 
that is, writing the eigenvalues of in the form j(j + 1)A*, 
with j as yet undetermined, we proceed to force (13) to satisfy 
the eigenvalue equation 



= jU + l)k^ 



(14) 


where we have used the temporary abbreviations 


a = m — I, P — m + ^. (15) 

Using in the form 42®®, we find that (14) splits up into the 
two equations^® 

< I 1 fi I 

ft(L. + tXJfliXr + (L* - hu + i/i")i22T! = y(i + DZi^iJaTf, 

which, in view of Equations 12*^ to 16'*’, reduce to 

|U(Z + 1) + a + + H 1 B 2 T 1 =j(j + l)i2iT“ 

I Ot jRiTi + [Z(Z + 1) — ^ + f]EaTj = j{j + 1)22* 

where, according to 10^, 

Hi^Gf = V{l + iy-m?. (18) 

The angle factors in each of the equations (17) cancel out, and 
we get 

+ 1) -Kj + 1) + w + ilEx + VH + hy - = 0 

W{l + h)^-m^Bx + [1(1 + 1) - j(j + 1) - TO + i]E* = 0. 


We restrict ourselves to the case when both T's in (13) are different 
from zero, but write our final results in completed form; see Exeroiaes 
1 and 3. 



494 PATJLrS THEORY OF SPIN §87 

Thus in an eigen-i^' of the radial factors Ri and Rz are no longer 
unrelated. 

Equations (19) can be made consistent by a proper choice of 
the quantum number jj called the inner quantum number y which 
so far has been undetermined. Inspection of (19) shows that the 
condition for the vanishing of the determinant of the coefficients is 

. W + 1) - jU + 1) + if = (Z + l)^ (20) 

that is, 

1(1 + 1) - Kj +l) + i^±(l + I). (21) 

Taking the lower sign, we get the possibility 

jU + 1) = KZ + 1) + (Z + i) + i = (i + m + f). (22) 

One root of this equation is 

i - Z + §, Z = 0, 1, 2, . . . , (23) 

and the second root is — Z— but, since j appears in (19) only 
in the combination j(j + 1), the second root adds nothing new. 
Similarly, the upper sign in (21) yields 

3(j + 1) = Z(Z + 1) - (Z + I) + i = (Z ~ m + i)y (24) 
so that 

i - Z - Z = 1, 2, 3, . . . , (25) 

while the second root of (24) adds nothing new. The inclusion 

of Z = 0 in (23) and its omission in (25) are justified by a special 

consideration of the case Z = 0, to be-taken up in Exercise 3. 

Substitutin g (23) into (19), we find that Ri/R^ == VZ~+"|“+ln/ 
vT+ § — m, that is, 

= Vr+Y+mR, fls = Vi + § - mR, (26) 

where R is an arbitrary (well-behaved) function of r. The result 
of substituting (26) into (13) will be denoted by = Z + f, m) : 

^ I Vl + h + TO - f)'^ ^ o 1 o 

VVi + I - mETil, m + ^))’ ^ = 0, 1, 2, • • • . 

The expression (27) is one form of the simultaneous eigen-^’s 
oi J,,L , and J . 



495 


PAULrS THEORY OF SPIN 


§87 


To get the remaining simultaneous eigen-l//^s of Jz , L% and 
which we denote by j = I — hj we substitute (25) 
into (19) and find that 

Rl = \/Z + § “ jR 2 = -“\/i + ^ +■ TTliJ, (28) 

and that hence 


j == I - 

/ “v/^ "f* ^ §) 

X — 's/l + ^ + mBTilf m + ^)J 
where the radial factor R need not be related to that in (27). 


Z — 1, 2, 3, 


(29) 


Exercises 


1. Set ni - Z 4- i in (13) and show that the corresponding simultaneous 
eigen-i/' of Jz , /A and is included in (27); show that (29) does not pro- 
vide for the case rn = I '•{- i and explain why this is to be expected on in- 
tuitive physical grounds. Consider in a similar way the case m- -Z - i. 

2. Show that, if m = rfc(Z + i), then the simultaneous eigen-i^'s of 

and are also eigen-^'s of Lg and Sj , and explain why this is to be 
expected on intuitive physical grounds. 

3. Use the result of Exercise 1 to justify the inclusion of Z = 0 in (27) 
and its omission in (29). 

4. Verify that for a state of the form (27) wfe have: 


Jti certainly mh 
I/®: certainly Z(Z -H l)j^* 

certainly i(y 4* 1)^* =* (^ 4“ i)(^ + 

2hnh f possible values : (w — i)/i (m 4* i)Zfc 

I/,, av Lt 2jf _j, probabilities: (Z 4- i -f ^0 (Z 4* i — wi) 


mh ^ f possible values: ih — 

Sz. av 5a 2Z 4* Irel. probabilities: (Z -f i + w?) (Z 4- i — wi). 

6. Consider states of the form (29) along the lines of Exercises 4 and 
show that the differences between the new and the former results are con- 
sistent with the fact that now < L*, while formerly > L®. 

6. Show that (27) and (29) are eigen-i/''s of the operator 


Lx Sr 4" H“ LzSgt 


which is usually denoted by L'S, and that in fact 

i = Z 4- i, w) = \lhmi, j = Z 4- i, w) (30) 

and 

L-sHl, j = I- h m) = -i(l + l)h^(.l, j =l-i, m). (31) 



496 


PAULI'S THEORY OF SPIN 


88. Spin-Orbit Interaction and the Spin Correction 

We now proceed to set up the Pauli Hamiltonian for an electron 
in a central electrostatic field described by a potential function 
V'(r). The total energy of the electron now consists of three 
parts: the kinetic energy, the potential energy, and the energy 
due to the interaction of the spin magnetic moment —es/mc with 
the orbital magnetic moment —eLI2mc] the latter energy is called 
the spin-orUt interaction energy. The kinetic and potential 
energies are quantities with which we are familiar; but new con- 
siderations are required for the spin-orbit interaction. 

Now, the case of two bar magnets suggests that the spin-orbit 
interaction energy should be proportional to the respective magni- 
tudes of the orbital magnetic moment and the spin magnetic mo- 
ment, and to the cosine of the angle between the two moments ; 
that is, because of the proportionality of the orbital magnetic 
moment to L and the proportionality of the spin magnetic moment 
to Sj we may expect the spin-orbit interaction energy to contain 
the factor L *5, that is, LxSx + LySy + . Further, we would 

expect this energy to depend on the distance of the electron from 
the center of the field. The interaction energy would thus be 
expected to have the form 

^(r)L.S, ( 1 ) 


where ^(r) is a function of r whose form depends on the poten- 
tial field. 

The intricate problem of evaluating the spin-orbit interaction 
energy was considered in the light of classical analogy by Thomas^ 
and by Frenkel and both concluded that this energy is given 
by (1) with 


{(r) = 


1 l <iy(r) 
2jn*c* r dr 


( 2 ) 


For example, in the case of the Coulomb potential —^Ir, the 
spin-orbit interaction energy is computed to be 


_i_lr 

2m^ <?r* 


( 3 ) 


L. H. Thomas, Nature^ 117, 514 (1926). J. Frenkel, Zeits. /. Fhysik, 
37, 243 (1926). Kemble, Section 58d. 



497 PAULI’S THEORY OF SPIN §88 

Recalling 3”, we now write our complete Hamiltonian oper- 
ator as 

where 

<24”) = + (5) 

Constants of motion and the two radial equations. The 
operators 

J.,L\A (6) 

which, according to Exercise 9“, commute with each other, do 
not involve the operators r or d/dr and consequently commute 
with operators involving only r and 9/ dr. Further, since J* = 
(L -f- s)* = Z/* s* -f- 2{LxSx "t" LySy -|- Z/iSi) = L -|- 2L-s, 

so that 

2L s = - L* - Jft*, (7) 

the operators (6) also commute with the operator L-s. Hence 
Jt , L®, and commute with every term in the Hamiltonian (4) 
and, as already mentioned in §87, constitute a set of commuting 
constants of motion in the Pauli theory of an electron in a central 
field. In computing the eigenvalues of (4) we therefore may, 
according to the expansion theorem, restrict ourselves to ^’s that 
are simultaneous eigen-^’s of , L*, and J*, that is, to ^'s of 
the forms 27” and 29”. In other words, instead of treating the 
eigenvalue equation Hi = Ei in its generality, we may restrict 
ourselves to the simpler problem of studying separately the two 
equations 

Hi(l, j = l+i,m) = EHl, j + (8) 

and 

HMl, i = 1- h m) = EiQ, j =- I - Im). (9) 

Now, 

+ 7(r) + {(r)I . s] id, j=l + hm), (10) 



498 PAULI’S THEORY OF SPIN §88 


so that, in view of 30®^ and the fact that our \[/ is an eigen-^ of L‘ 
belonging to the eigenvalue Z(f + 1)^*, 




^ r 1 s 
■’“'-Ls”' 


+ 


l(l + l)h^ 


2mr^ 


+ V{r) + m'iir) 




I + 2, "»)• ( 11 ) 


The bracketed operator in (11) is free from matrices and hence 
does not mix the two components of \l/. Therefore, when we 
introduce the explicit form 27®^ for i^, and (8) splits up into two 
equations, the angle factors cancel out in each equation and both 
equations reduce to the same radial equation, namely, 




l = l + h 

LZ = 0, 1, 2, • • • . 


(12) 


Treating (9) in a similar way, we get another radial equation, 
namely, 




J = 1, 2, 3, • • • . 


(13) 


The energy levels of our system are thus the values of E which 
yield well-behaved solutions of (12) and also the values of E 
which yield well-behaved solutions of (13). 

Apart from the terms in f(r), Equations (12) and (13) are both 
identical with the Schroedinger radial equation 

Since spin effects are known to be small, we may consider the 
terms in as perturbing potentials and may handle (12) and 
(13) by perturbation methods. 

The spin corrections for hydrogen. If ip{n I mi) is the nor- 



499 


PAULI’S THEORY OF SPIN 


malized Schroedinger eigenfunction 25®° of the Hamiltonian of 
hydrogen, then'® 

where ao is the Bohr radius 2l“. For hydrogen, ^(r) = e’‘l2m^cV, 
so that for the hydrogen state i^(n I mi) the average value of the 
quantity |(r), that is, of the perturbing potential in (12), is 

e‘ 1 l^n|« . 

2mV‘n»l(Z + §)(f + l)aS 4n^ (2i + l)(i + l)’ 


where £!„ is the nth Bohr level 18®° and a is the fine-structure 
constant 6°®. Similarly, for the hydrogen state \p{nlmi), the 
average value of -h l)A®f(r), that is, of the perturbing po- 
tential in (13), is 

_LM“ d?) 

dn* 1(21 + 1) ■ 


Now, in three-dimensional perturbed motion, just as in the 
one-dimensional case treated in §36, the first-order energy correc- 
tion for a nondegenerate perturbed level equals the average of 
the perturbing potential for the corresponding unperturbed state. 
A closer study shows that this simple result holds in our present 
case in spite of the degeneracy of the Bohr levels; and conse- 
quently the eigenvalue of (12) which belongs to the state 
’/'(«) ho = ^ which we denote by E{n, l,j = l + i) 

can be obtained in the hydrogen case to the first approximation 
by adding the quantity (16) to the energy En of the Schroedinger 
state if'(n I mi). Treating (13) in a similar way, we get our final 
results : 


Ein, l,j == I + k) 


= En -t* 


|Kla° 

4n* 


4» 

(21 + m + 1 ) 


E(.n, 1,3 = l-h)=En- 


En I a 4n 
4n2 l{2l -f !)■ 


(18a) 

(18b) 


The levels (18) arc shown in Fig. Ill, the left portion of which 
is a reproduction of Fig. 107°®. The values of j arc indicated at 


“ Condon and Shortley, page 117. 

"Here ‘to the first approximation’ means ‘correctly to terms in l/c».’ 



500 


PAULI’S THEORY OF SPIN 


§88 


B V d f 

J-0 i-1 l~Z 1-3 

—r’Vi 

I 


I 

*. 

I 



9 



the right of the levels; the 
vertical lines connect the 
sublevels with their parent 
Bohr levels. Note that 
each s-level is raised and 
each of the other levels is 
■^2 split up into a doublet ; each 
sub-level is of course degen- 
erate with respect to m. 
The arrangenaent of the 
levels is different from that 
required by experiment, 
and their number is almost 
twice too great. The spin 
corrections alone are thus 
insufiSicient to bring agree- 
ment with experiment ; but 
we shall* see in the next sec- 
tion that we do get the de- 
sired results when the spin 
corrections are combined 
with the relativity correc- 


Fig. 111. Spin corrections for hydro- tions 

gen The So^erfeld levels are shown Thes-levels. The quan- 

at the left. The odd half -integers are , 

the values of y. tity L s is zero whenever 

I = 0, so that the spin- 
orbit interaction energy (3) is zero whenever L = 0. Since 
i = 0 for s-states, the formula (3) thus implies that the spin- 
orbit interaction energy is zero for s-states, that is, that the 
spin causes no displacement in s-levels. This conclusion is borne 
out by (12), which becomes identical with the Schroedinger radial 
equation (14) whenever I = 0. 

Yet (18a) gives a nonvanishing spin correction even when 
Z = 0; and, what is more, the correction it gives when Z == 0 is 
, correct. Two questions therefore arise: Why does (18a) give a 
nonvanishing correction when Z == 0? And why does (18a) give 
the correct nonvanishing correction when I ^ 0? The answer to 
the first question lies in the fact that the average value of r”** 
given by (15) contains the factor Z in the denominator, and that 
in getting the correction (16) we have canceled this Z with the I 



501 


PATJLrS THEORY OF SPIN 


§88 


that multiplies f(r) in (12). To the second question there is 
perhaps no satisfactory answer, and we may summarize the situa- 
tion as follows: the Pauli method fails when applied specifically 
to an s-state; but, perhaps fortuitously, the correct spin correc- 
tion for an s-state can nevertheless be found by extrapolating to 
the case Z = 0 the Pauli results computed for I > 0. 

Exercises 

1. Derive the radial equation (13). 

2. Extend Fig. Ill through ti = 5. 


89. The Spin-and-Relativity Correction for Hydrogen 

We now turn to a Pauli electron moving relativistically in a 
Coulomb field. One way of treating this problem is to replace 
the nonrelativistic Pauli Hamiltonian 4®^ by the appropriate rela- 
tivistic Pauli Hamiltonian and to compute the eigenvalues of the 
latter.^^ But the simpler way, which we shall follow', is merely 
to combine the relativity corrections for the Schroedinger hydro- 
gen atom, found in §83 to be 

(16-) -[^’(^-8) + ™?, (1) 


with the Pauli spin corrections for the Schroedinger hydrogen 
fitom computed in §88. 

In the case j = I + hi the spin correction is 


(18a**) 


En I a® in 

i2l + l)(,l+iy 


( 2 ) 


The sum of (1) and (2) is 


En\c? 

in^ 



+ WIC®, 


that is, since j — I + hi 


\E.\aW in _ \ 

\3 + h J 


+ me*. 


(3) 


( 4 ) 


Compare C. G. Darwin, Proc. Boy. Soe., 116A, 227 (1927). 



502 


PAULI’S THEORY OF SPIN 


§89 


IS 


la the ease j = Z — the sum of the relativity correction (1) 
and ‘the spin correction given by 

\ B. 


\En\ct , 2 


(5) 


SO that we get the expression (4) once again. Thus the spin-and- 
relativity corrections depend only on j and, except for the limita- 
tions that the value of I places on the values of j, are inde- 
pendent of L 

Adding (4) to Bohr^s En and denoting the resulting energy 
levels by E^j , we finally get 




me 


( 6 ) 


Since j = §, f, I-, • • • , this formula is identical with the Sommer- 
feld formula 7®", in which i; = 1, 2, 3, • • • . We conclude that, 
when the Schroedinger theory of the isolated hydrogen atom is 
corrected to terms in 1/c^ for both the relativity effects and the 

spin effects, its results are 

d f 

Z=2 Z«3 


s 

2=0 


V 

Z=1 


1 

7/3 

II 

1 

5 

——•3/2 — 

13 

i i 

——1/2 — 

■■ 1 

1 

II 

9, 


1 — » J 

3/2 



^ 1 

1/2 


Fig. 112* Spin-and-relativity correc- 
tions for hydrogen. Tfhe Sommerfeld 
levels are shown at the left. The half- 
odd integers are the values of j. 


in agreement (to terms in 
1/c^) with experiment inso- 
far as the possible energies 
of the atom are concerned. 

The energy levels re- 
quired by (6) are shown, 
apart from the term mc^j 
in Fig. 112, whose left por- 
tion is a reproduction of 
Fig. 107. Comparing this 
figure with Fig. Ill, wc 
note, in particular, that the 
relativity corrections shift 
adjacent spin doublets in 
such a way as to reduce the 
number of levels to that re- 
quired by the Sommerfeld 
formula; but the. number 
of basic states is still almost 
twice as great as in the 
Sommerfeld theory, so that 



603 


PAULI’S THEORY OF SPIN 


§89 


our present hydrogen ntoin responds to external perturbing fields 
somewhat differently from the Sommerfeld hydrogen atom. 

Exercises 

1. Extend Fig. 112 through n = 5. 

2. Show that the operators Z-s -h (where Z* 5 = LxSx + -f- LzSg) 

and d p (= o-xjo* + o'i/Py + o'*7>a) anticommute with each other, and that 
h^(2h~'^L-s H- 1)^ = -h Note that commutators such as Lxpy — VvLx 
can be found, except for the factor ih, in Table 17^^ 

90. Space Functions and Spin Functions 

In many applications of Pauli\s theory it is convenient to use a 
notation that is loss explicit but more concise than our previous 
notation. 

Every instantaneous Pauli \l/ has the form 



and can therefore l3e written as 



where / and g are functions of the coordinates and are called 
space functions. Replacing the coefficients of / and g in (2) by 
the single letters a and /3, we now rewrite (2) as 

lA = a/ + (3) 

and call ot and /S spm functions. 

In order to be able to manipulate Pauli written in the 
form (3) so as to get the same re.sults that are^ obt.ained when the 
more explicit form (2) is used, we must develop rules for handling 
the spin functions a and jS. In tlie first place, we must have tlie 
rule 

(a/i + ^g\) + (a /2 + &g^ = a(/i + /a) + /3({7i + < 72 ). (4) 

Since the two column symbols in (2) are eigenvectors of be- 
longing, respectively, to the eigenvalues 1 and ~ 1 of , we 
require further that 

^z{oif + ^g) = a/ - Pg. (5) 

Similarly 

cjc(af + pg) ^ ag + Pf (6) 



504 


PAULI’S THEORY OF SPIN 


§90 


and 

o-,(a/ + iSfif) = -iotg iPf- 0) 

To take care of the normalization and the orthogonality of the 
two column symbols in (2) we introduce the equations 

a = 1 , = 1 , = 0 , ( 8 ) 

although the equations aa = I, BP — li ~ ~ ® would 

perhaps be more appropriate. Since operators not involving the 
tr’s operate only on the space functions, we have 

L.iaf + ^g) = uLJ + |8L^, (9) 

and so on. 

To illustrate the new notation, we shall recompute the eigen-i/'’s 
of Jz • The eigenvalue equation 2”, Jt'P = mhp, now reads 
+ Bg) = + Bg), that is, 

(I. + + Bg) = mh{af + Bg)- (10) 

Using the rules for handling a and /3, we reduce (10) to 

aL,S -f BL.g + ihaf - \h,Bg = omhf + Bmhg (11) 
and then to 

a{Lz — tnh iK)f -j- BO-'* — — i^Jg “ 0. (12) 

Multiplying (12) through separately by a and by B and recalling 
(8), we now get the two equations 

(Zi, — mh + h^)f = 0, (L, — mh — ih)g = 0, (13) 

which are just the equations 4®^. From this point the procedure 
is the same as in §87, but the final result appears in the form 

^(m) = aAx(r, + BA^ir, (14) 

rather than in the equivalent form 

Exercise 

1. Compute the simultaneous eigen-^'s of Jm , L“, and using the 
notation of this section. 



Chapter XIV 


ELEMENTS OF DIRAC^S THEORY OF THE ELECTRON 

We finally turn to the highest refinement of quantum me- 
chanics which is at present available, namely, the Dirac theory of 
the electron. In order to be able to present the elements of this 
theory in a particularly simple way, we shall restrict ourselves to 
the case of an electron moving in a purely electrostatic external 
field. Now, a field that is purely electrostatic from the stand- 
point of one observer may involve a magnetic field from the 
standpoint of a second observer who moves relative to the first. 
Consequently our restriction is a serious one; in particular, we 
shall have to forego the discussion of one of the most important 
features of the Dirac theory, namely, its invariance under the 
so-called Lorentz transformations of the relativity theory. 

It should perhaps be emphasized that Dirac^s theory is spe- 
cifically a theory of the electron, rather than of any other particle. 

The manner in which Paulies theory forms an approximation to 
Dirac's is discussed in Exercise 

91. The Dirac Equation for an Electron in an Electrostatic Field 

The relativistic equation that connects the total energy E, the 
potential energy V = V{x,y,z), and the components of linear 
momentum Px , Py , and p, of a particle of rest mass m is 

(1“) i(S-F)* = pl + 2>J + P.* + mV. (1) 

Solving (1) for E, and then writing H for £?, we get the Hamil- 
tonian 

J? = 7 zb cVp* + pj + pj + (2) 

A reference to (1) shows that the upper sign in (2) corresponds to 
positive values of the kinetic energy E — V, while the lower 
sign yields negative values of the kinetic energy; in classical 
relativity the second alternative is of no physical interest and 
can be disregarded. 


m 



506 DIRAC'S THEORY OF THE ELECTRON §91 


If we associate the appropriate Schroedinger operators with 
the quantities on the right side of (2), we get the operator 


H = V ± 




This operator is complicated at best: if it should be used, the 
equation H\p - Eyp would no longer be a differential equation of 
finite order. The fundamental difficulty, however, is that adopt- 
ing this operator makes it impossible to construct expressions for 
probability current and probability density which satisfy a 
relativistic equation for the conservation of probability. 

Now, in §83, we avoided the operator (3) by not solving (1) 
explicitly for E; but this procedure is also unsatisfactory in 
several respects. For example, it prevents us from determining 
the time dependence of i^-functions directly through the second 
Schroedinger equation H\l/ = ih d\p/dt. The straightforward way 
out of this particular difficulty is as follows: We recall that an 
eigenfunction of H satisfies simultaneously the two equations 
H\p = E\p and Hip = ih d\(// dt, so that insofar as an eigenfunction 
of is concerned the number E can be replaced by the operator 
ih d/dt; we then replace E by ih d/dt in 5®® and get a differential 
equation that fixes the time dependence of \p. But, as inspection 
of 5 will show, the equation so obtained is of the second order 
in b/dty and hence the knowledge of the ^-function pertaining to 
the instant ^ = 0 is insufficient for the computation of ^ for a 
later instant of time; to compute 4f for all time we would, in fact, 
have to ^ow not only ^ for i = 0 but also b^p/bt for t = 0. This 
•feature is of course in direct contradiction to the quantum- 
mechanical method. 

The problem of formulating a relativistic quantum •mechanics 
thus leads us directly into an impasse: Equation (1) is unsatis- 
factory for quantum-mechanical purposes whether or not we solve 
it explicitly for E. The way out was found by Dirac: to get the 
correct Uarmttonian, we must free the Hamiltonian (2) from the 
radical that it now contains. This idea, which in retrospect seems 
simple, led to one of the most important equations of all quantum 


express the quantity pi + + pi + as a perfect square; 



507 DIRAC’S THEORY OF THE ELECTRON §91 

and perhaps the simplest possibility that preserves the symmetry 
between Pz, Py , Vz i and me is to write 

pl + pI + pi + = (ofxpa: -f a^Py 4- azPz 4- amrncYy (4;) 

where the coefficients ax j ay j az , and am are yet to be determined. 

Now, we obviously cannot satisfy (4) by taking the a^i to be 
ordinary numbers; in fact we have 

(axPx + OtyPy + azPz 4 " 

~ axPx 4" axaypxPy 4“ axazPxPz 4” axa^Px^^^ 

4^ ayaxPypx 4" ^vPy “{” azPyPz 4" ayamPy'f^C 

4- oizaxPzPz + azayPzPy 4- a\pl 4- azamPzmc 

+ amaxmepx 4 ” amayTricpy 4 - amazmepz 4 “ ah^nc^f ( 5 ) 

so that if the a’s are ordinary numbers the right side of (4) in- 
volves cross-products of the p's which do not appear on the left. 
But we note that if wo take the a's to be operators which anti- 
commute with one another, so that 

I axay = —ay ax axaz = --azax = —amax 

( 6 ) 

ayag — —azay ayam = —amay a,am ~ —amazy 

then, since the p’s commute with one another, the cross-products 
cancel out and the right side of (4) reduces to alpl + ay pi + 
alpl 4- amm^c^; and if we require further that 

2 2 2 2 1 /rr\ 

ax — ay — az — am — Ij (.7) 

then (4) is satisfied in every detail. We therefore conclude that, if 

the a's are operators satisfying (6) and (7), then the square root of 

pl + pI + pl + be written as axpx 4- (^vPv + + amrnc 

and that, adopting the low(‘r sign in (2), we can write (2) as 

H = V — caxPx — caypy — cazPz - (8) 

A closer study shows that the a's should be linear operators, 
independent of the coordinates, of the p’s, and of t; but apart 
from this no further restrictions are placed on the a’s. Note that 
the replacement of the a’s by — a’s does not alter (6) and (7), 
so that it is immaterial whether we take the upper or the lower 
sign in (2); this conclusion implies that in Dirac's theory we 



508 DIRAC’S THEORY OF THE ELECTRON §91 


cannot restrict ourselves from the outset to states of positive 
kinetic energy. 

Replacing the p’s in (8) by the appropriate Schroedinger opera- 
tors, we get the operator 


= F -f- ictuxg — + ichuy - — h ichx, 


dx 


dy 


dz 


(9) 


This linear operator is the celebrated Dirac Hamiltonian for an 
electron in an electrostatic field. 

The matrix a’s and the four-component Although the 
theory does not require that we ascribe specific mathematical 
forms to the a’s, in practice it is usually expedient to do so. We 
therefore proceed to identify a set of four linear operators that 
satisfy (6) and (7), and that do not depend, on the coordinates, 
the p^s, and t] this problem is like that of finding specific operators 
satisf 5 dng the PauU conditions 3®® and 6^^, except that now we look 
for four anticonmiuting operators. Just as in Pauli's case, it is 
convenient to take our operators to be matrices; we find, how- 
ever, that no four 2-by-2 matrices will anticommute with one 
another, and that 3-by-3 matrices also will not suit our purposes. 
Turning to 4-by-4 matrices, we find a variety of possibilities, of 
which the following is adopted by Dirac: 


a, = 


a, = 



0 

0 

1\ 


0 

0 

-i\ 

0 

0 

1 

o\ 

/o 

0 

i 

o\ 

0 

1 

0 

0 ' 

p 

11 

o 

—i 

0 

0 


0 0 0 / 


\i 0 0 0/ 



0 

1 


/I 

0 

0 



0 

0 

-M 

(0 

1 

0 

o\ 

1 

0 

0 

0 ’ 


0 

-1 

o) 

\o 

-1 

0 

o/ 

\o 

0 

0 

-1/ 


( 10 ) 


An advanced argument shows that any other choice of the a's 
will lead to the same physical conclusions, even if we use matrices 
of more than four rows and columns. 

Now, if the Dirac operators are to involve 4-by-4 matrices, 
then the Dirac operands must have four components; that is, a 
Dirac ^-function must have the form 


4 ' = 



( 11 ) 



509 


DIRAC’S THEORY OF THE ELECTRON 


§91 


where each of the four components is an ordinary function of 
X, y, z, and t. These xp’s, which, except for the doubling of the 
number of components, are like the Pauli ^’s, are manipulated 
in essentially the same way as the Pauli ^’s. In particular, the 
Dirac ^ corresponding to (11) is the row symbol 

(16”) ^ (12) 

and the corresponding Dirac is the following ordinary function : 

(17®®) ^ = Ml 4- M 2 + hh + Ma . (13) 

Two Dirac and are said to be orthogonal to each other if 
/ dr = 0, that is, if / + ^- 2 ^? + v^s^F + dr = 0. 

The time dependence of a Dirac ^ pertaining to a system whose 
Hamiltonian is H is determined through the equation 

= (14) 


where H is the Hamiltonian operator (9) and where 


(36”) 


/V'i\ Id4'i/dt\ 

^ ^ 9 / 1^-8 \ / drpi/dt \ 

dt di\]f's \d^»/dtj' 


(15) 


Note that (14) is formally the same as the second Schroedinger 
equation. 

When Dirac’s a’s have the explicit forms (10), the Dirac 
operators associated with ‘ordinary’ dynamical variables are ob- 
tained by throwing the corresponding Schroedinger operators into 
a matrix form after the manner of 19”, except that now the 4-by-4 
unit matrix is used in the transformation. 

It is a fundamental assumption of the Dirac theory that the 
average value of a dynamical variable a for a state \j/ is given by 
our familiar expectation formula 


<8"> av.a - (18) 

i^pypdr 

where a in the integrand is now the Dirac operator associated 
with the dynamical variable a. Note that the a in (16) is any 
dynamical variable, and not necessarily one of the subscripted 
a's in (10). 



510 DIRA^C'S THEORY OF THE ELECTRON §91 


The equation of motion of a dynamical variable not involving t 
explicitly retains in the Dirac theory the form 16^®. 

The Dirac equation. The equation H\f/ = E\l/f with H given 
by (8), that is, the equation 

(7 - caxPx - cayfy — cotzVz — = E\l/f (17) 


with px = —ihd/dXj and so on, is called the Dirac equation for an 
electron in an electrostatic field. The computation of the energy 
levels of an electron moving relativistically in an electrostatic 
field consists in Dirac^s theory in the calculation of the numerical 
values of the parameter E for which (17) has well-behaved 
solutions. 

To rewrite the. Dirac equation in a more explicit form, we 
replace the by the specific matrices (10), replace by a four- 
component column sjnnbol, and, for uniformity, multiply V by 
the 4-by-4 unit matrix. The result is 

V — mc^ 0 —cpz —cipx - /M l^i\ 

0 Y — m(? —c{px + ipv) cpx I I = £ I 1 

— cp* “c(p» — ipy) y + 0 A / 

— c(px + ipv) cpz 0 7 -f mc^ JwJ 

so that (17) reduces to the four simultaneous equations 

' (7 - mc^)\l/i - cpz\pz - c{px - ipv)^4 = Eii 

(7 - mc^)\p2 + opz^A - c{px + iVv)^z = 

1 2 

(7 + me )\/'8 - cpzh - c{px - ipy)h = 

^(7 + mc^)\l/i + cpzh - cipx + ipv)h = E\1/a . 


Finally, we replace px by —ih dJdXj and so on, and after a slight 
rearrangement get the equations 

i ^ + (I - * ^ 
i V’ ^ ^ ° 

rl “ ) ^4 "I" I — t" ^ T ~ ) ~ O' 

[h\ c \dx dy/^ dz^ 


(18) 


( 20 ) 



511 


DIRAC^S THEORY OF THE ELECTRON 


§91 


Thus the single symbolic Dirac equation (17) amounts to the 
four explicit simultaneous partial differential equations (20). 
And, to determine the energy levels of an electron moving rela- 
tivistically in an electrostatic field in which the potential energy 
of the electron is V = V{x, y, z), we must compute the values of 
the numerical values of the. parameter E for which the system of 
equations (20) has well-behaved solutions; by Veil-behaved^ W’-e 
mean here that at least one of the component functions , ^ 2 , 
^|/z , and ypi is different from zero, and that each of the nonvanishing 
component functions is well-behaved in the sense of the Schroe- 
dinger theory. 

Exercises 

1. Show that according to (7) the only eigenvalues of a* , ay , a* , and 
ocm are 1 and —1, and verify in detail that the particular a’s given by (10) 
have just these eigenvalues. 

2. Use (9) and 16^® to show that the operator associated in Dirac's 
theory with p* , that is, with the aj-component of the force acting on an 
electron in an electrostatic field, is —dV/dx. 

3. Use (9) and 16^® to show that the operator associated in Dirac's 
theory with the a;-component x of the electron velocity is — ca* and that 
consequently the only possible results of a precise measurement of a com- 
ponent of the electron velocity are c and — c. The fact that mx is not equal 
to px is in line with classical relativity (for example, relativistically, the 
absolute value of x cannot exceed c, while p» can have any real value 
whatever); but the result stated in italics is new.^ 

4. Show that each of the two matrices ±a*o!ya!*am anticommutes with 
every a in (10) and becomes 1 when squared. 

6. Show that dba*Q:ya,aw are the only 4-by-4 matrices that anticommute 
with every a in (10) and become 1 when squared. 

6. To verify in a small way that the properties of the Dirac Hamiltonian 
(9) do not depend in any essential way on the choice of the a's, provided 
Equations (6) and (7) are satisfied, replace a, in (10) by its negative and, 
replacing the symbol ^ in (17) by set up the equations that then take 
the place of (20); then show that these equations become identical with 
(20) if the symbols , i/'s , i/'s , and ^4 are replaced, respectively, by — >^1 , 

i 'Pz j and —xpi . 

7. Relate the result of Exercise 6 to the fact that when a* in (9) is 
replaced by —a, the resulting operator is the transform of (9) by the 
matrix iaxocyam . 


1 G. Breit, Proc. Nat. Acad. 5ci., 14, 553 (1928). See also E. L. Hill 
and R, Landshoff, Rev. Mod. Phys., 10, 116 (1938). 



512 


DIRAC’S THEORY OF THE ELECTRON 


§92 


92. The Dirac ar*s 

In some computations it is convenient to rewrite the Dirac 
Hamiltonian 8” in a slightly different form. We introduce the 
five matrices 


and 


O’* 



1 

0 



—i 

0 

1 

0 

0 

o\ 1 

' i 

0 

0 

\0 

0 

0 

1’ "”^1 


0 

0 

\o 

0 

1 

0/ 

\o 

0 

i 


Pi = 


<T, = 


0 

0 

1 


0 0 0 -ly 




P» = 


fl 0 0 

0 10 0 
0 0-1 0 
Woo -ly 


( 1 ) 


( 2 ) 


and note that the square of each <r and each p is unity, that the 
(T^s anticommute with each other, that the p^s anticoxnmute with 
each other, and that each (x commutes with each p. Our p 3 equals 
the Urn of 10^^ — ^the relabeling of am is due partly to the con- 
venience of being able to say *‘the p's commute with the <t's'^ 
rather than and commute with the cr's." Dirac's theory 
employs besides pi and ps a matrix labeled p 2 , but we shall not 
need this matrix here. 

Now, the cr's, the p's, and the a's of 10®^ satisfy the equations 


Pi O'* , Oy — PlO-„ , a* = , CKm = Ps ; (3) 

and consequently the Hamiltonian 8^^ can be written as 

S = V — Cpi^CxPx + O-yPy + O-^p,) — P3??2C^. (4) 

^ The Dirac (r's (1) just like the Pauli <r's 12®® except for a 
doubling ; their physical significance will be demonstrated in the 
next section. 



513 


DIRAC’S THEORY OF THE ELECTRON 


§93 


93. The Existence of a ‘Spin’ Angular Momentum 

To exhibit a remarkable feature of the Dirac equation, we turn 
to the case of a central electrostatic field, when the Dirac Hamil- 
tonian 4** becomes 

H = V{r) - cpxiiTxPx + <fvPv + <^zPx) — pawc®, (1) 

and look for constants of motion. 

Classically, L* , the a:-component of the orbital angular mo- 
mentum, is a constant of motion in a central field. To check 
whether this is so also in the Dirac theory, we recall the equation 
of motion 16” and compute the commutator of the Dirac operator 
I/* and the Dirac H. Since L* commutes with every quantity in 
(1) except Py and p, , we have 

LxH - HLt = -cpi[iT„(L*p„ — pyL,) -f- cr,(L*p, — p,L*)] 

= -cpi[ijy{ihp^) -t- (r.{-ihpy)] 

= -ihcpiiffyTp, - <r,py) 0 . ( 2 ) 

Hence, in the Dirac theory, in contradistinction to classical 
mechanics, the ^-component of the orbital angular momentum 
of an electron moving in a central electrostatic field is not a 
constant of motion. 

But we note also that, since <Tx commutes with every quantity 
in (1) except oy and c, , we have 

<r,H — l?<r, = — Cpi[((r*<rv — Oyd^Pu -f (<r*(r, — <r,<r,)pj 

= -cpi[(2i(r,)p, + (—2i(Ty)p,] 

= 2icpi(ff„p, - ffxPv)- (3) 

Thus LxH - HLx = - Hcx) and 

a. -t- Pcr.)H - -f- \hax) = 0, (4) 

so that the dynamical variable i, -f- is a constant of motion 
in a central field, 

The quantity JA<r„ which must be added to L* before we get 
a constant of motion, has its origin in the relativistic considera- 
tions that led us to adopt the Dirac Hamiltonian 9** as the correct 
Hamiltonian, and therefore represents a deeply rooted relativistic 
effect. This quantity has essentially the same form as that asso- 



514 


DIRAC’S THEORY OF THE ELECTRON 


§93 


ciated in the Pauli theory with the a;-component of the spin angular 
momentum of the electron and, to put the matter roughly, is in 
a large part responsible for the fact that Dirac’s theory accounts 
for the phenomena that have been previously attributed to a 
rotary motion of the electron. For these reasons it is called the 
a;-component of the spin angular momentmn of the electron al- 
though, since the question of the rotation of the electron does 
not arise at all in Dirac’s theory, this name is somewhat mis- 
leading. 

We introduce the notation 

(2 ) = \h<Tx j Sy = \fl<Ty J Sz ~ y (5) 

where the cr’s are Dirac’s, and also the notation 

(39 ) */ a ” Ijx 4“ j Jy ^ Ly -j" ^ 1 / y J t ^ Lz -f- Sz • (6) 

The quantity Jx , which we call the a;-componcnt of the total 
angular momentum of the electron, was shown above to be a 
constant of motion in central fields. Similar arguments show 
that Jy and Jz , and with them the quantity ~ , 

are also constants of motion. 

A more advanced argument shows that according to Dirac’s 
equation an electron behaves in a magncticj field as though it 
possesses a spin’ magnetic moment ^ related to the spin angular 
5 through the equation ^ = --esImCy that is, the equation postu- 
lated by Uhlcnbeck and Goudsmit. In short, Dirac's equation 
automatically endows the electron with the properties that account 
for the phenomena previously ascribed to a hypothetical spinning 
motion of the electron. 


94. Constants of Motion for a Central Field 


Dirac’s equation, that is, the system of the four equations 2if\ 
is not easy to handle directly even if F = V{r), and in treating 
central motion we shall therefore once again have recourse to the 
expansion theorem. Our first task is to identify a suitable set of 


commuting constants of motion for the case when H is given 
by 


In §93 the quantities J*, Jy, Jg, and were found to be 


constants of motion for central fields; J«, Jy , and Jg do not 



515 


DIRAC’S THEORY OF THE ELECTRON 


commute with, one another, but each commutes with J^, so that 
we may adopt the set 


J z — Lz St , — {L s'f — -{• 2L-S -{• fA*. (1) 


Written out more explicitly, the operators are 

/L. + I/1 0 0 0 \ 

r _ 0 L.-^h 0 0 

* “ 1 0 0 L. + ^h 0 

\ 0 0 0 i, - W 

and 


( 2 ) 


/L® + liL, + f ft® h{U-iLy) 0 

/ ft(L, + iLy) L® - ftL* + f ft® 0 

1 0 0 U + hL,JrW 

\ 0 0 h{Lx + iZ/y) 


I \ 

ft(Z;p iLy^ j 
Z® - ftL. + f ft®/ 


Comparison with 41®® and 42®® shows that Dirac’s Jz and are 
just like Pauli’s except for a doubling. 

On account of the way in which the p’s appear in the Hamil- 
tonian 1®®, the operator L® does not commute with H, so that the 
magnitude of the orbital angular momentum is not a constant of 
motion in Dirac’s theory of central fields. This implies, inci- 
dentally, that in Dirac’s theory it is not quite correct to picture 
the vectors L and s as simply precessing about the vector J as 
shown in Fig. 110®’^. 

The loss of as a constant of motion is fortunately offset by 
the fact that the quantity 


K 




(4) 


that is, 


1 I + iLy 
0 


(6) 


Lz + h - iLy 0 0 

-Lz+h 0 0 

0 — — h —Lx + iLy 

0 — jC/jP iLy Lz ~~ ^ 

constructed by Dirac, commutes with H and also with J z and J . 

In central fields we thus have the commuting constants of 
motion 

Jz , j\ K, (6) 



516 


DHIAC’S THEORY OF THE ELECTRON 


so that in computing the eigenvalues of E we may, according to 
the expansion theorem, restrict ourselves to those eigen-i/'’s of H 
that are simultaneously eigen-V'’s of /, , /*, and K. 

The simultaneous eigen-f’s of , P , and K. In this com- 
putation we shall make use of the fact that we already know the 
Pauli eigen-V'’s of J, and J*; a procedure not involving the Pauli 
lA’s is outlined in Exercise 5. 

If we set 


/i = yj + m T(j . 

gi = \/j + l-m T(y -f I, Jn - ^) 

js = y/j - otTQ' - I, m + I), . . 

gt = — -s/j -I- 1 -f mX(j -h m -I- j^), ^ 

the Pauli ^’a 27®^ and 29*^ take the form 

= ( 9 ) 

The values of j are, as before, 

i = i I, I, • • • . (10) 

Note that we now place the emphasis on j rather than on h 
The xj/^B (9) and their linear combinations are simultaneous 
Pauli eigen-^’s of Je and belonging to the respective eigen- 
values Jim of Jt and j(j + l)h^ of J^, Now, Dirac^s operators 
«/f and are identical with Pauli's except for a doubling, and 
consequently the four-rowed ^'s 





/M 

W2J 



XpD — 




(11) 


and their linear combinations are simultaneous eigen-^’s of Dirac's 
J, and belonging to the respective eigenvalues hm of J, and 
iO" -f 1)** of /. 

Operating on each of these >p’s in turn with the operator K 
and using Equations 12^ to 14:^, we find that 


KPi = U + k^b = -(} + i)^B , 

Ki/c = - (j -I- ^)i'c , 


K'I'd = (/ -t- ^)i'D , 


( 12 ) 



617 


DIRAC’S THEORY OF THE ELECTRON 


§94 


80 that each of these is an eigen-^ of iiC; in particular, it 
follows from (12) and (10) that the eigenvalues of Jf, which we 
denote by /b, are 

k = rbl, ±2, ±3, ... . (13) 


Since the four ^’s (11) do not all belong to the same eigenvalue 
of Kj not all of their linear combinations are eigen-^’s of K. 
But ^pA and rpo do belong to the same eigenvalue of if, and there- 
fore all of their nonvanishing linear combinations, which we 
denote by k = j + m) atid write as 

/2ev.\ 

f (i, k = j + ^,m) =1 I, (14) 

are eigen-^’»s of K, The radial factors and in (14) are 
arbitrary and independent of each other, except that at most one 
of them may be identically zero. Similarly, yps and ypc belong to 
the same eigenvalue of if, and therefore all of their nonvanishing 
linear combinations, which we denote by V'O’j k = —j — i, m) 
and write as 

'p(j, A = -y - »») = I I 

\W 


are also eigen-^f-’s of K. The R’s in (15) are of course unrelated 
to those in (14). 

Written out in greater detail, the simultaneous eigen-i^’s of 
J, , J^, and K axe 

VT+w i W - I)' 

■Vj-m -§,»«+§) 

Vi + 1 - + i w - ^) 

[-VT+T+m B°T(y + i m 4- IX 


f (y> k = j + = 


(16) 


and 

Hj,k = -j-hm) = 


\/j -j- 1 — m i?T(y + I, OT — I) 
-VT+T+m B^T(y + I, W + 1) 
Vj+m B“T(j - I, OT - J) 
Vi — »» B^'^U — 1> w + IX 


(17) 



518 


DIRAC^S THEORY OP THE ELECTRON 


§94 


Exercises 

1. Show that K commutes with J» , and with the Hamiltonian 

2 . Verify Equations (12). 

3. Show that 

-I- ( 18 ) 

and that consequently every eigen-^ of K belonging to the eigenvalue 
k oi K 18 automatically an eigen-^ of belonging to the eigenvalue 
(k — i)(k i- of J^. Note that hence the set of the two commuting 
constants of motion J» and K is equivalent to the set of the three con- 
stants used in the text. 

4. Use (10) and (18) to show that (13) includes all eigenvalues of K, 

5. Compute the ^’s (16) and (17), without using the Pauli ^’s, as follows: 

(a) Require a V' with components , ^ 2 , V's , and ^4 to be an eigen-^ of 
belonging to the eigenvalue mh of Jz , with m as yet undetermined, and thus 
find the dependence of , and ^4 on the angle 4>. (b) Require the 

eigen-^ of /, to be an eigen-^ of — h^K belonging to the eigenvalue X' 
of J® — h^K and, restricting yourself to the components and , deter- 
mine their dependence on the angle d, (c) Require the eigen-^ of 

to be an eigen-^ of + k^K belonging to the eigenvalue X" of + h^K 
and, restricting yourself to the components and ^ 4 , determine their 
dependence on $, (d) Write down the simultaneous eigen-^^s of Jz # 

— h^Kf and -f- and require them to be eigen-^^s of K belonging 
to the eigenvalue k of K. 

6 . Show that j can be eliminated from (16) and (17) with the help of 
the quantities fc, \k\, and A;t = k/\k\ in such a way that (16) and (17) 
become identical in form. 

96. The Racial Equations 

Having computed the simultaneous eigen-^^s of J, , J*, and K, 
we proceed to consider the simultaneous eigen-^'s of Jg , K, 
and H for a central field. Our problem is to find the conditions 
that the Dirac equation, taken in the form 19®^, with the under- 
standing that V = y(r), imposes on the four components of ^ 
after these components have already been partly determined by 
the requirement that ^ be an eigen-^ of J. , and K, that is, 
that \p have the form 16®^ or the form 17®^. 

We begin with ^(y, fc = y + m\ substitute V'l , ^ 2 , ^ 3 , and 
^4 from 16 into the first of the four equations 19®\ and get the 
equation 

(7 - mc’)Vi + m5^T(j - m - ^) 

- cp.Vj + 1 + I, m - I) 

+ c(p. - ip.) Vj + 1 + m 2?"T(i + i m + i) 

= Ey/JT^ B'tO' - i, m - i), (1) 



519 DIRAC’S THEORY OF THE ELECTRON §95 

which in view of 17^ and 18^ reduces to 

(E -7 + m^)R^ - ich + U + T-] = 

or, since in the present case fc = i + i? 'to 

(i? - 7 + - ich + (* + D = 0- (3) 

The second of the four equations 19®* leads once again to (3), 
while the third and fourth each yield the same new equation 

i |/1?| (E-V- - (ft - 1) ^] = 0. (4) 

Turning to the i/'O, = — i — h given by 17®^, we find that 

the first of the four equations 19®^ yields an equation slightly 
different from (2) ; but when we eliminate j from this equa- 
tion by means of &, which now equals — j — we get (3) once 
again. Similarly, the second of the equations 19 also yields 
(3), while the third and fourth each yield (4). The fact that 
fc = i + i and k= -j - h give identical results 
when these resxilts are expressed in terms of k rather than j is not 
unexpected in view of the conclusion of Exercise 6 . 

The substitution 

B.-R\ a - ® 

reduces (3) and (4) to the equations 

\[E- 7(r) + mc^]Ra - - (ft + 1) y = 0 

eft ar r 

1[E- 7(r) - rnc^m + - (ft - D = »• 

Since these equations incorporate both of the special cases 
fc = ,• + I, m) and ^P(j, ft = -J - h quantum number 

ft in (6) runs through its complete range : 

(13»*) ft = drl, ±2, ±3, • • • . (7) 

We now conclude that the problem of computing the energy 
levels of an electron moving in a central electrostatic field reduces 
in the Dirac theory to the problem of computing the numerical 
values of the parameter E for which the system of the two sunul- 



520 


DIRAC’S THEORY OF THE ELECTRON 


§96 


taneous first-order ordinary differential equations (6) has well- 
behaved solutions Ra and Rh . These equations, called the radial 
eqiiotions, must be studied separately for each particular form of 
■F(r); an example is given in the next section. 

The substitutions 

(3”) X« = , Xb = (8) 

reduce (6) to the more convenient form 

(9) 

- 7(r) - mc^]xb + ^ ^ 

A method for deriving the radial equations without the use of 
constants of motion was given by Darwin;^ a S 3 mabolic method 
for deriving the x-^quations (9) was given by Dirac.® 

Exercises 

1. Eliminate Ra between the two equations (6) and show that , which 

we shall denote simply by R, satisfies the equation 

i » - F - - r + (I + ^)« - 0. 

( 10 ) 

where Y = y(r). 

2. Show that, with the help of 24^ and 2®*, Equation (10) can be 
written as 

2m(?(R - 7 + P’r + j R+VR 

- 7 + jnc*)-* j^(fc + l){(r)R + rt(r) ^ j 

= (jr - md‘)R. (11) 

* C. G. Darwin. Proc. Boy. Soc., life, 654 (1928). 

•Dirac, §73 « 



521 


DIRAC’S THEORY OF THE ELECTRON 


96. The Energy Levels of the Hydrogen Atom.* 

We now consider along the lines of the Dirac theory an electron 
moving in the fixed Coulomb potential e/r; the potential energy 
of the electron then is 

(1“) F(r) = -i*, (1) 


and Dirac’s x-6<l^ations 9*® become 




( 2 ) 


This problem is one of the few that can be solved precisely. 

Using primes to denote differentiation with respect to r and 
introducing the abbreviations 


^ = (twc* + E)/ch,, B = (mc^ - E)/ch, (3) 


and the fine-structure constant 

(6**) a =* e^/hc, (4) 

we rewrite (2) as 

.4x0 -1- aXo/r — Xb — ha/r = 0 
.jBxj — ax»A - Xo + hOi/r = 0. 

Next we write the functions x. £uid x^ in. the forms 

Xo = e~°' 2. a,r*, s = t, r + 1, t -f- 2, • • • , (6) 

X6 = Y,, h.r*, s = T, r -f 1, T -1- 2, . • ■ . (7) 

The constant C is as yet undetermined; the a’s ^d Vb are also 
undetermined, but to avoid trivialities we require that at least 

* Dirac, §74. Just as in our previous computations involving relativity 

or spill) we make no correction for nuclear motion. 



522 


DIRAC’S THEORY OF THE ELECTRON 


one of the two coefficients and hr be different from zero. Sub- 
stituting (6) and (7) into (5), we get 

Ims — (s -f fe)6a]r*”^} = 0 

Is* {[B&« + Cag]r‘ — \aba -|- (s — fc)as]r*“^} = 0. 

To determine r, that is, the exponent of r with which the series 
in (6) and (7) begin, we equate to zero the coefficients of the 
lowest [the (t — l)th] power of r in (8); the resulting equations 

Oidr — “1“ fc)hr ~ 

(9) 

ocbr + (r — fc)Or = 0 

have nonvanishing solutions Cr and hr if 

r = \/P — o? (10) 

and also if r is the negative of (10); the second possibility is of no 
interest to us, since it yields x^s that are ill-behaved at the origin. 

We now proceed to correlate the a^s and the in (6) and (7). 
Collecting the (s — l)th powers of r in (8) and equating the sums 
of their coefficients to zero, we get 

Aa^^i + + ora, — (s + fc)b, = 0 (11a*) 

Ca«-i + — (s — — aha = 0. (11b) 

Two^ of the four unknowns, namely, a,_i and 6,-1 , can both bo 
eliminated from Equations (11) provided we choose the as yet 
undetermined constant C so as to make the determinant of the 
coefficients of and hg^i vanish; so we set 

C = ^ AB = VmV — E^/hc, (12) 

If we now multiply (1 la) by C and (11b) by A, and subtract, we 
find that the c’s and the 6’s are related as follows: 

ICcc + <5 _ k)A]a, + [Aa - (a + k)C]b. = 0. (13) 

Next we consider the behavior of the a’s and 6^s for large 
values of a, when (13) reduces to the approximate equation 

Aa,^C6, . 


(14) 



523 


DIRAC’S THEORY OF THE ELECTRON 


Eliminating the 6’s from (11b) with the help of (14) and the equa- 
tion Ao._i ^ C6,_i which follows from (14), we get 

[C + AB/C]a.-i - [(s - fc) -1- Acx/C]a, ^ 0, (15) 

or, since s is large, [C -h AB/C]at-i — sa, = 0, that is 

a,/a.-i ^ 2C/s. (15) 

Eliminating the a’s from (16) with the help of (14), we find that, 
for large values of s, b,/b,-i S 2C/ s. 

Since the right side of (16) is just the ratio of the coefficients of 
the successive powers of r in the power series for e ® , we conclude 
that for large values of r the series in (6) and (7) behave like 
e*®’’ unless they terminate, so that the x’s themselves behave like 
Now, according to (12), C is positive HE < me and 
imaginary if > mV. Hence our x’s are well-behaved when 
r =0 if S* > mV and. unless the series in (6) and (7) terminate, 
are ill-behaved if < m^c*. We conclude: the electron zn a 
Coulomb field may have any energy greater than mc^ or srmller 
than -me®; but not all energies lying between —me and me 

are possible. ... . . 

Finally we turn to the case when the series in (6) terminates, 

say with a term in r^, so that Os 0 and as+i — ®s+4 • • • 0; 

according to (13), the series in (7) then also terminates with a 
term in r®. Since, as we have seen, both series start with a term 
in and the exponents of r increase by steps of unity, we have 

S - T = positive integer or zero. (17) 


Introducing the integer n having the range 

„=lfcl,lfcH-l, |fcl-l-2, •••, (18) 

we may write (17) as /S — t = n — ] fc | , or, in view of (10), as 
S = n — 1 fc 1 -1- Vfc* — ^^^) 


We may now use (11) and (13) to compute the value of a./o,-i 
without assuming s to be large, and then find the condition imder 
which as+i/ag is zero; but a simpler procedure is as follows: 
We write S for s — 1 in (11a) and set as+i and bs+i equal to zero, 

+ CbB = 0. (20) 



524 


DIRAC’S THEORY OF THE ELECTRON 


Elimination of &g from (20) by means of (13) then yields the equa- 
tion 2 jSiAC = {A^ — C^)a, -which, in vie-w of (3) and (12), reduces to 

S\/m?c^ — = Eot. (21) 

Squaring both sides of (21) and recalling (19), we get the equation 

(„ _ I A; 1 + - E^) = oc^E^ (22) 

which has the positive solution 

and a second solution that is just the negative of (23) ; the 
negative solution, however, does not satisfy (21), and is therefore 
of no interest to us. The ranges of k and n in (23), given by 7®® 
and (18), can be reindicated as follows : 

n = 1, 2, 3, • . . , C24) 

k = ±1, ±2, ... , ±(n - 1), (25) 

The reason for omitting the value fc = n in (25) is taken up in 
Exercise 3. 

' We note that, except for the duplicity of the levels for which 
\ k \ < rij the formula (23) is identical with the Sommerfeld fine- 
structure formula 3*^. 

Besides the continuous energy spectrum extending from wc* to oo and 
the discrete spectrum described by Equations (23) to (25) and lying just 
below the Dirac electron in a Coulomb field has, as we found on the 
preceding page, also a continuous energy spectrum extending from — ww;* 
to — 00 . It is a characteristic of the Dirac theory that, in addition 
to levels for which the kinetic energy of the electron is positive, it yields 
levels of negative kinetic energy; to put it roughly, the mathematical 
reason for this is that, since the algebraic signs of the of’s in 
9®^ are immaterial, the ambiguous sign of is implicitly contained in 
Dirac^s Hamiltonian. 

In classical relativity the states of negative kinetic energy can be dis- 
regarded altogether. But according to the Dirac theory the probabilities 
for quantum jumps from states of positive to states of negative kinetic 
energy do not all vanish, and hence the latter states cannot be disregarded 


® The value of wic* is approximately half a million electron volts ; the 
lowest level of the discrete spectrum lies about 13.5 electron volts (that is, 
the ionization potential of the hydrogen atom in the normal state) below 
wic®. 



625 


DIRAC’S THEORY OF THE ELECTRON 


without introducing inconsistencies into the theory; further^ certain mathe- 
matical expansions are impossible if the ^’s describing states of negative 
kinetic energy are left out. 

These difficulties were reduced by Dirac through the hypothesis — 
which we state very roughly — that the states of negative kinetic energy 
are already filled with electrons, that transitions from states of positive to 
states of negative kinetic energy are therefore impossible, and that elec- 
trons having negative kinetic energies (and correspondingly curious 
properties) are distributed throughout space with such a high and uniform 
density that we are unaware of their existence; this hypothesis carried 
with it the theoretical implication that under certain circumstances there 
should come into being particles having the mass of the electron but the 
opposite dhsLXge-^anti-^lectrona, as Dirac then called them. This forecast 
was verified a few years later when Anderson discovered experimentally 
the existence of such particles, now called positrons. 

Exercises 

1. Consider the consequences of reversing the algebraic sign of C in (12). 

2. Construct the respective second-order differential equations approxi- 
mately satisfied separately by xa and xb in (6) for large values of r, and 
from their solutions determine the possible values of C in (6) and (7). 

3. Show that, if the series in (6) and (7) each contain only a single term, 

so that iS « T, then (13) is automatically satisfied in view of (9) and is 
therefore useless for the computation of the levels; then assume from the 
outset that the series (6) and (7) each contain only a single term, write 
down the equations to which Equations (8) reduce in this case, and show 
that when 0 < E < mc^ they are consistent only if fc*ia negative; finally 
verify that the levels lying between 0 and mc^ and corresponding to the 
single-term case are accounted for by (23) to (25). ^ 

4. Illustrate the Dirac classification of the hydrogen levels by a diagram 
patterned after Fig. 108**. Mark the values of j for each level and note 
that, as in Fig. 112*®, the levels having the same n’s and the same j’s are 
coincident. 

97. The Normal State of the Hydrogen Atom 

We shall now find explicitly the Dirac for the normal state 
of the hydrogen atom. In this work it is convenient to use the 
abbreviations 

y = \/l — (x^j do = (1) 

where Oo is the Bohr radius introduced in 21®®. ^ 

The lowest energy given by the fibne structure formula 23 is 

E = 7WC^ (2) 



526 


DIEAC^S THEORY OF THE ELECTRON 


§97 


This energy corresponds to the values 

^ = 1, 7c = -1 (3) 

of the principal quantum number n and the auxiliary quantum 
number h; note that, according to 25 ®®, —1 is the only value of 
k consistent with 71 = 1. Equations 10 ®® and 12^® reduce in the 
present case to 

r = 7 , (7 = l/Oo . (4) 

Since now \ k \ — n, the series 6®® and 7 ®® reduce to the single 
terms 

Xc = arre~^% Xh = Kr ( 6 ) 

Substituting ( 5 ) into 2^®, we find that the x-equatious are satisfied 
provided 

OtA = — «/(l + 7)* (6) 


Hence, apart from a common arbitrary multiplicative constant^ 
our x's are 


x« = orV'"", X 6 = -(1 + 


( 7 ) 


where 7 and Oo are defined in (1). Note that, since a is approxi- 
mately yItj and hence 7 is only a little less than 1, the absolute 
value of xa is much smaller than that of xt - 
Equation 8®® now yields 

Ra = = -(1 4. 7)r^-V"^"“. (8) 


Since y is a little less than 1, both Ra and Rb have mild infinities 
at r = 0; they are nevertheless well-behaved.® 

According to (8), ( 3 ), and 5 ®®, we have, for the 'normal state, 

= tv/ 3 (l + ( 9 ) 

and we are ready to attend to the angle factors. 

The simultaneous eigen-i/^^s of and K have either the 

form = j -1- given by 16 ®^or the fonm/'O*, ft = —j— 
given by 17 ®^. Since the only possible values of j are |, and 
so on, and since in the present case ft = —1, we are here dealing 
with the case j = |, fc = — j— i = _ and our interest is con- 


• Recall the footnote on page 402. 



(10) 


527 DIRAC’S THEORY OF THE ELECTRON J97 

fined to f(i, h = -j-i m). Setting; = h substituting (9) 
into 17“ we get 

^(£r = 7JMC * > j — i} ^ ~ 

a Vi - niY(l,m - ] 

— « \/t + w tCi, ?» + §) 
i \/3 (1 + t) ^ ~ 

[ i \/3 (1 + 7 ) Vi - »» OT -h %) J 

It remains to determine the suitable values of m, whose range 
is given by as ±h, ±l ± 1, and so on. If 1 m 1 is greater than 
then all four components of (10) vanish ; hence the only possi- 
bilities are m = — | and m = i. In the first case we get 

= yin ^,3 = I, fc = — l,»n = —i) 

aV27r^ 


= c; 


-oTJ 


Vs (1 + 7)yS 


( 11 ) 


where C[ is the arbitrary numerical constant that we first dropped 
in (7). Using the table of T's on page 414 and ahsorbmg the com- 
mon numerical coeflficients of the elements in the arbitrary multi- 
plier, we get, more explicitly, 

piE = 7mc*,i = i, ft = -1, w = -i) 

f « sin 5 e"**) 


= Ct 

Similarly, when w = i we have 
tKB = 7mc’,i = -l,«i = i) 




—a COS 0 
0 

[ i(l 4- 7 ) J 




r’^ie"r'»o. 


( 12 ) 


—a V 2 7i 
f V3 (1 + r)7S 
0 


r'r-’e""''*, (13) 



528 


DIRAC’S THEORY OF THE ELECTRON 


§97 


that is, 

HE = ymc^, j = i,k = -1, »n = ^) 


= a 


a COS 8 
a sin 8 e** 
*•( 1 + 7 ) 
0 


^7-lg-rMo 


(14) 


where Cs is another arbitrary constant, independent of Ci . 

The normal state of the hydrogen atom is thus doubly de- 
generate, and any nonvanishing linear combination of the ^’s 
(12) and (14) describes a state for which the energy is certain to 
have its normal value 7 mc*. 


Exercises 

!• Convert the instantaneous ^’s (11) and (13) into time-dependent ^’s. 

2. Show that the states (11) and (13) are mutually orthogonal. 

8. Show that for the state (13) av L, = a*^/3(l y) and av s, = 
ih — €3t^h/Z(l 4- 7)» so that electron spin is almost entirely responsible for 
the ^-component of the total angular momentum. What are the possible 
values of L, for the state (13), and what are their respective probabilities? 
Make similar computations for the state (11). 

4. Show that for the state (13) av L* = -f- 7). What are the 

possible values of L* for this state and their respective probabilities? 
Make similar computations for the state (11). Note that, apart from 
terms in the value of L* for these states is 0. 

6. For states of greatest interest, the value of E is nearly equal to me*. 
Show by inspecting 2** that the absolute value of xi» is in this case much 
greater than that of xa ; Equations (7) illustrate this result. Accordingly, 
for states of positive kinetic energy, the components ^1 and of a X)irac ^ 
are called ^small,’ and the components and v^4 ‘large, ^ although, on 
account of the angle factor, a large component may sometimes turn out 
to be zero, as illustrated by (12) and (14). 

6. Consider the classification of the components of Dirac ^*s into 
‘large’ and ‘small’ for states of negative kinetic energy having a total 
energy nearly equal to — wc*. 

7. Inspect 16®* and 17®* and use the result of Exercise 6 to show that the 
value of L® for a state ^(y, k, m) can be taken approximately as K(A; -f l)h*; 
the last remark of Exercise 4 illustrates this result for the case A; = — 1. 
We can thus say, approximately, that the r61e of the azimuthal quantum 
number I is taken over in Dirac’s theory by k when k is positive and by 
—fc — 1 when k is negative. 

8. Contrast the Dirac and the Schroedinger distributions-in-position 
of the electron of a hydrogen atom in the normal state; but note that the 
two become identical if in Dirac’s distribution we let c 00 , 



529 DIRAC^S THEORY OF THE ELECTRON §97 

9. Set E nearly equal to me*, so that, except perhaps for very small 
values of r, ^ — Y -j- me* can be replaced by 2mc*, and show that then 11*® 
reduces to 

P’ + + F(r) - i(k + J R - r((r) ^ 

=«(^ — mc?)R, (16) 

In view of the last remark of Exercise 7, Equation (15) is essentially 
the same as the Pauli radial equation 12“ if fc is negative and as the Pauli 
radial equation 13®* if k is positive, except for the straightforward rela- 
tivistic replacement of F by — ttw*, and the extra term r$(r)i2', which 
has no classical analogue. A closer study®'^ shows that in the case of a 
Coulomb field the extra term can be disregarded unless A; « — 1 (that is, 
unless Z “ 0 and we have an «-state), while for s-states it provides just the 
correct spin correction; in short, this extra term removes the difficulty 
with 5-states, discussed at the end of §88, that arises in the Pauli theory. 

We conclude that in the case of radial fields the Schroedinger theory 
and the Pauli theory can be regarded as successive approximations to the 
Dirac theory; the situation is similar in the case of other fields. 


^ Condon and Shortley, page 130. 



APPENDIX 



APPENDIX 


A few formulas Involving Legendre functions and spherical harmonics 
The functions^ 


(6F*) QT »= (-l)K«+i«i) — 1^1) ! 





A.*,.!''' 

(1) 

where , 

ft, * COS 1 = 

0, 1, 2, ... 

, and w « 0, ±1, db2, • • • , ±:Z, are ortho- 

normal in 9 (see 62^0 

and satisfy 

the equations’ 




cos 9 ©7 ^ 

irer+x + B7er-i 

(2) 


f 

sin 9 ™ 

-CTe?+7 + l>Te?-V 

(3) 


i 

sin d ©7 =* 

1 1 

(4) 




+ 

1— 

1 

(5) 


m 

cot 6 ©7 = 

-iGr©?"^* - iJE^rer"', 

(6) 

where 

the expansion 

coefficients 

are 


A7 = 

/(i 4“ + 

1)(Z — w 4- 

1) jp”* 1 /(!* 4“ fn)il “ wi) 

(7) 

"I 

y (2f+ 

l)(2J + 3) 

~ ‘ y ( 2 j-i)( 2 j+i) 

C? = 

/(2 4- + 

1)(Z 4" w 4“ 

2) p" - 1 /(* “ ”*)(* - Jn - 1) 

(8) 

o 1 — 

V (21 + 

l)(2I + 3) 

‘ y (21- l)(2l + 1) 

e7 «= 

/(i - m + 

1)(Z — la 4" 

2) j?** „ /(Z + w»)(Z 4- — 1) 

(9) 


y (21 + 

l)(2I + 3) 

~ ‘ y ( 2 {-i)( 2 i+i) 

QT - 

\/(Z — m)(Z 4-* w -h 1) 

ff? - V(j + m)(l - m + 1). 

(10) 

The functions* 








(11) 

(66”) 


t7 

‘ ,5 

‘ VT. 


1 These functions are the conventional normalized Legendre functions 
of the first kind, except for the Condon-Shortley clioice of the phase factors. 

* Condon and Shortley, page 52. Bethe, pages 661-660; note that 
Bethe’s SJm equals our (-l)"*e7 . 

•These functions are the conventional normalized complex tesseral 
surface spherical harmonics, except for the Condon-Shortley choice of the 
phase factors. 


532 



APPENDIX 533 

are orthonormal in the angles (see 79’^) and satisfy the equations* 

(L. + iLy)B^:7 = (12) 

(L. - iLy)Brt7 - (13) 

L,Bt7 - mhBxr (14) 

jyierr = t(i + i)miT (15) 

(p. + ipy)Bx7 - ihc7 1 ® j 


- ihDT [f + (i + 1) 7 ] tT-/ (16) 
(p, - ipy)BrT = -m7 ^ 7 ] 

+iftj!’rj^^+(i + i)^jTrr,‘ (17) 

p.Rt7 = (l + l)®]rr-,, (18) 

where jB an arbitrary function of r only, where , L* , and so on, are 
the standard Schroedinger operators, and ‘where , and so on, are 

the coefficients (7) to (10). 


* Bethe, loc. cii. 



INDEX 



INDEX 


A 

Alpha particle, 227 
Amplitude, probability, 92, 443 
Analogue, classical, 76 
Anderson, C., 525 
Angle between functions, 329 
Angular momentum: 
orbital, 417, 435, 437, 488, 490, 513 
possible values of, 479 
spin, 477, 482, 490, 513, 514 
total, 489, 490, 514 
Anticommutation, 269 
Approximations: 
in powers of A, 186 
numerical, 180, 185 
perturbation, 198, 368, 374 
quantization, 191 
transmission coefficient, 221 
variational, 193 
Argand diagram, 298 
Associative law, 292 
Assumptions, 73, 74, 77, 86, 89, 90, 
164, 279, 280, 354, 389, 442 
av, 54, 56, 104, 420, 447 n.; av^, 90; 
av«, 60 

Average, 54, 420 

B 

Band, energy, 269 
Barnett, S. J., 447 n. 

Barrier, potential, 219 
rectangular, 157, 211 
single-step, 217 
Bateman, H., 181 n. 

Bauer’s formula, 456 n. 

Bennett, A. A., 181 n. 

Bessell function, 453 
Bethe, H., 460 n,, 532 n., 533 ii. 
Binding, 173 
conditional, 178 
virtual, 222 
Bloch, F., 269 n. 

Bloch’s theorem, 269, 439 
Bohr, N., 1, 164 n., 461 
Bohr: 

frequency condition, 164 
hydrogen levels, 461, 470 
radius, 461 


Bohr (Cont’d): 
unit, 437 

Born, M., 220, 230 n. 

Bound: See Binding 
Boundary conditions, 11, 241 
Box, particle in, 50, 157, 244 
Breit, G., 251 n., ‘511 n. 

Brillouin, L., 186 n. 
de Broglie, L., 1, 230 
de Broglie wave, 236, 448 
B. W. K. method, 186 n., 188 

C 

Campbell, G. A., and Foster, B. M., 

. 138 n. 

Canonical conjugates and equa- 
tions, 45, 410 

Central field and motion, 426, 448 
degeneracy in, 451 
Characteristic function, number, 
and value, 14 n. 

Cofactor, 324 

Coherent superposition, 118 
Column symbol (s): 
addition (sum) of, 303, 312 
definition of, 302, 311 
equality of, 312 
multiplication of, 312, 313, 382 
Combination, linear, 34 n. 
Commutator, 7, 326 
Composition of states, 106 
Compton, A. H., 238 
Compton effect, 124 
Concave function, 16 
Condition(8): 
boundary, 11, 241 
continuity, 11, 401 
Dirichlet, 34 n. 
frequency, 164 
maximal, 52 
quantum, 74 

Condon, E. U., 220. See also 
Gurney and Condon 
Condon, E. U., and Morse, P. M., 
104 n. 

Condon, E. U., and Shortley, G. H., 
27 n., 165 n., 373, 412 n., 470 n., 
479 n., 499 n., 529 n., 532 n. 


537 



538 


INDEX 


Configuration space, 90, 442, 468 n. 
Conservative field, 43, 418 
Constant (s): 
fine-structure, 471 
of motion, 44, 46, 257, 266, 426 
commuting, 260 
distribution in, 268 
Planck’s, 74 
Schroedinger’s, 80 
Sommerfeld’s, 471 
Continuity: 
conditions, 11, 401 
equation, 209, 423 
Convex function, 16 
Correspondence principle and rule, 
362 

Courant, R., and Hilbert, D., 40 n. 
Current: See Probability current 

D 

d, 39, 403 

Darwin, C. G., 146 n., 484 n., 501 n., 
520 n. 

Davisson, C. J., 238 
Degeneracy: 
in central field, 451 
in Coulomb field, 463 
of eigenvalue, 31, 288, 318 
degree of, 31 
of matrix, 319 
removal of, 205, 373 
del, 405 

Delta (inexactitude), 54, 56, 60, 420 
Delta-function, 68 
det, 323 
Determinant: 
of matrix, 323 
secular, 323 

Deviation, standard, 54 n. 

Diagonal and diagonalization: See 
Matrix 

Difference equation, 283 
Dirac, P. A. M., 1, 69, 74, 163 n, 
260, 391, 393, 398, 606, 520 n. 
621 n., 525 
Dirac equation, 510 
Dirichlet conditions, 34 n. 
Dispersion, 54 n. 

Dispersive propagation, 232 
Distribution (function), 52, 54, 56, 
61, 419 

amplitude of, 92, 137 
continuous, 65, 420 
discrete, 63 


Distribution (Cont’d) : 
Gaussian, 60 
in energy, 111, 444 
in r, 422 
in Xt 92 

in xyz, 420, 443 
node of, 104 
normal, 60 
relative, 67 
stationary, 68, 421 
Doppler effect, 167. 

Dual space, 384 
Dwight, H. B., 191 n. 

E 

Ehrenfest, P., 249 n. 
Eigenfunction, 14, 169 
simultaneous, 29 
Eigen-psi, 393, 487 
Eigenray, 288, 316 
Eigenstate, 104, 260 
simultaneous, 260 
Eigenvalue: 

degenerate, 31, 288, 318 
multiplicity of, 31, 318 
of equation, 169 
of mapping, 287 
of matrix, 315, 321 
of operator, 14, 406 
physical significance of, 73 
spectrum, 15, 319 
Eigenvector, 287, 315 
simultaneous, 326 
Eigenwert, 14 n. 

Einstein, A., 1 
Energy, 44, 418 
band, 269 

distribution. 111, 444 
level, 84 
virtual, 226 
spectrum, 84 
state, 103 

Even function, 18 n. 
exp, 17 n. 

Expansion: 

coefficient, 34, 404, 441 
Fourier, 34, 39, 40 
theorem, 260 
uniqueness of, 35 
Expectation, 53, 54 
Expectation formula: 

Dirac, 609 

Heisenberg, 388, 389 



INDEX 


539 


Expectatiott (ContM): 
of p-method, 280 
Pauli, 488 

Schroedinger, 90, 93, 443 
symbolic, 398 

F 

Fermi, E., 163 n* 

Floquet's theory* 269 n. 

Foster, R. M.: See Campbell and 
Foster 
Fourier: 
coeflSicient, 34 n. 
expansion, 34, 39, 40 
integral, 132 
inversion of, 143 
series, 34, 39, 40 
transform, 143 
Frame change, 360 
unitary, 364 

Frank, N. H., an d Young, L. A., 
220 n. See also Slater and 
Frank 

Free particle, 47, 144, 430, 452 
Frenkel, J., 496 
Frequency: 
de Broglie, 216 
condition, 164 
of wave, 231, 426 
Function space, 327 

G 

Gamow, G., 228 n. 

Gaussian distribution, 60 
Geiger-Nuttall law, 2^ • 
Generating function, 25 
Germer, L* H., 238 
Good behavior of function, 11, 278, 
402, 486, 611 

Goudsmit, S.: See Uhlenbeck and 
Goudsmit 

Gram-Charlier series, 40 
Group, 261, 292 
velocity, 233, 237 

Gurney, R. W., and Condon, E. U., 
228 m 

H 

K h, 74 

de Haan, D. Bierens, 150 n. 
Hamiltonian, 44, 144, 418 
Hamilton's equations, 45, 419 
Heisenberg, W., 1, 120 n,, 164 n, . 


I Heisenberg: 

assumptions, 354 
expectation formula, 388, 389 
inequalities, 121 
matrices, 341, 347 
method, 353 
principle, 122 
Heitler, W., 163 n. 

Hermite* C., 23 n. 

Hermite: 
functions, 24 
polynomials, 23 
Hermitian: 
matrix* 326 
operator, 253, 400 

Hilbert, D.: See Courant and 
Hilbert 

Hilbert space, 327 n. 

Hill, E. L., and Landshoff, R., 
611 n. 

Hydrogen levels: 

Bohr, 467, 465 
Dirac, 621 

relativity corrections for, 472, 501 
Sommerfeld, 471 
spin corrections for, 498, 601 
Hydrogenic, 464 n. 

Hydrogen-like, 464 n. 

I 

j[demfactor, 3 
ni-behaved function, 12 
Im, 32 

Ince, E. L., 173 n., 269 n,, 282 n. 
Incoherent superposition, 117 
Indefiniteness, 54 n. 

Indeterminacy, 54 n. 

Induction, mathematical, 23 n. 
Inexactitude, 64, 60, 420 
Integrable square, 37, 403 
Integration: 
by parts, 40 n. 
two-way, 181 
Inversion: 
in origin, 290 
of integral, 143 

J 

Jackson, D., 327 n. 

Jeans, J. H., 69 n., 407 n., 411 n. 
Jeffreys, H., 186 n. 

Johnson, R. P..: See Seitz and 
Johnson 



540 


INDEX 


Jordan, P., 1 

J. W. B. K. method, 186 n. 

K 

Kappa, 80 

Kemble, E. C., 14 n., 73 n., 187 n., 
193 n,, 327 n., 402 n., 448 n., 
496 n. 

Kennard, E. H., 127 n. 

Kennard packet, 127 
Kramers, H. A., 186 n., 275 n. 
Kronig, R. de L., and Penney, W. 
G., 275 n. 

L 

Laguerre polynomial, 461 
Landshoff, R, : See Hill and Land- 
shoS 

Laplacian, 405 

Lattice, potential, 268, 269, 439 
Legendre : 
equation, 408 
function, 411, 532 
operator, 407 
polynomial, 412 
Legendrian, 407 
Length: 

of function, 329, 330 
of vector, 293, 299, 312 
Level : See Energy 
Life, mean, 167 
Limit, classical, 49, 96 
Linear momentum, 44, 78, 130, 417, 
427, 429 

M 

MacColl, L. A., 212 
Mapping(s): 
addition (sum) of, 290 
axes of, 2^ 
consecutive, 299 
eigenvalue of, 287 
identical, 290 
inversion, 290 
linear, 287 

multiplication (product) of, 290 
of function space, 331 . 
of plane, 286 
operator, 287 
reciprocal of, 291 
representation of, 293* 
singular, 287 
unambiguous, 287 


Mass energy, 473 
Matrix, matrices: 
addition (sum) of, 302, 310 
algebra, 309-320, 322-327, 360- 
374, 380-385 
anticommuting, 326 
as operator, 315 
commuting, 326 
commutator of, 326 
complex conjugate of, 325 
component of, 309 
definition of, 309 
degenerate, 319 
determinant of, 323 
secular, 323 
diagonal, 322 

diagonalization of, 360, 366 
approximate, 368 
eigenvalue of, 315, 321 
eigenvector of, 315 
element of, 309 
equality of, 309 
Heisenberg, 341-353 
Hermitian, 326, 383 
multiplication of, 300, 305, 310, 
311, 382 

perturbation method, 368, 374 
product of, 300, 310 
reciprocal of, 324 
secular: 

determinant of, 323 
equation of, 316 
singular, 323 
time average of, 380 
time derivative of, 349 
trace of, 322 
transform of, 363 
transpose of, 325 
unit, 322 
unitary, 364 
zero, 322 
Maximal set, 52 

Milne, W. E., 168 h., 181 n., 186 n, 
Milne’s method, 185 
Modulus, 33 
Moment: 

magnetic, 477, 478 
of momentum, 417 
Momentum {See also angular, lin- 
ear, and radial momentum) ; 
method, 277-284 
state, 131 

Morse, P. M., 276 n.: See also 
Condon and Morse 



INDEX 


541 


Motion: 
central, 448 n. 

constants of, 44, 267, 258, 260, 
266, 425 

equation of, 255, 429 
Multiplication, associative, 292 
Multiplicity of eigenvalue, 31, 318 

N 

iV-dimensional space, 320 
von Neumann, J., 1, 253 n. 

Node, 26, 104 
Normalizable function, 37 
Normalization, 37, 398, 403, 485 
Normalizing factor, 38, 403 
iV-space, 321 
27-vector, 311 

O 

Odd function, 18 n. 

Operand, 2 
Dirac, 508 
Heisenberg, 386 
of mapping, 287 
of matrix, 315 
of p-method, 277 
Pauli, 482 
rule, 89 

Scbroedinger, 77, 86, 428 
symbolic, 391 
Operator (s), 2 
addition (sum) of, 5 
algebra, 5 
anticommuting, 269 
assumption, 73 
commutator of, 7 
commuting, 7 

differential, 6, 334, 404, 485 n. 

representation of, 334 
Dirac, 509 

eigenvalue of, 14, 73, 406 
equality of, 5 . 

Hamiltonian, 144 
Heisenberg, 342 
Eermitian, 253, 254, 400 
Laplace, 405 
Legendre, 407 
linear, 8 
momentum, 130 

multiplication (product) of, 6, 7 
of p-method, 278, 279 
Pauli, 482 
real, 253 


Operator(s) (Cont'd): 

Scbroedinger, 77, 427, 467, 468 
self-adjoint, 253 
spectrum of, 15 
symbolic, 391 
unit, 3 
zero, 3 
Orthogonal: 
functions, 33, 329, 404 
sets, 33, 404if 407 
vectors, 293, 299, 312 
Orthonormal: 
sets, 37, 404, 416 
vectors, 293 
Oscillator: 
anharmonic, 205 
anisotropic, 438 
isotropic, 426, 433 
levels of, 81, 281, 355, 393 
line spectrum of, 165 
linear harmonic: 
classical, 47, 61, 62, 69 
energy states of, 93 
Scbroedinger functions for, 87 
packets for, 69, 70, 112 

P 

Packet: 

Eennard, 127 
MacCoU, 212 
probability, 65, 112 
narrt)w, 249 
reflection of, 212 
wave, 234 
Particle, 237 
Pauli, W., 478, 484 n, 

Pauli operands and operators, 482 
Pauling, L., and Wilson, E. B., 
24 n., 193 n., 460 n. 

Peirce, B. 0., 191 n. 

Penney, W. G.: See Kronig and 
Penney 

Perturbation methods, 198, 369, 374 
38, 41, 348 

arbitrariness of, 38, 348 
initial, 231 

of complex number, 33 
of motion, 49 
of wave, ^1 
relative, 117 
velocity, 232 



542 


INDEX 


Phii 

Heisenberg,’ 387 
symbolic, 397 
PlAtick, M., 1 
Planck’s constant, 74 
p-method, 277-284 
Poisson bracket, 45, 419 
Positron, ^25 

Potential (function), 43, 418 
p-representation, 2^ ; 

Probability (See also Distribution) : 
amplitude, 92, 443 
conservation of, 209, 423 
current, 207, 210 
density, 4^, 446 
reflection of, 214 
density, 420 
ot transition, 164, 349 
packet, 65, 112, 421 
narrow, 249 
reflection of, 212 
Propagation : 
constant, 231 
vector, 425 

Proper function, number, and value, 
14 n. 

Psi, psi-funCtion, 77, 188, 279, 386, 
392, 428, 484, 508 
basic polar, 449, 451 
instantaneous, 130 
state, 92 

Q 

Quadratic integrability, 37, 403 
Quantization, 84 
approximate, 191 
Cartesian, 430 

integer and odd half-integer, 192 
Quantum: 

Condition, 74 
number, 440 
axial, equatorial, and mag- 
netic, 437 
azimuthal, 438 
inner, 494 
prineipab 451 

R 

Ra<lial: 
equation(s) : 

Dirac, 518 
Pauli, 498 
Scbroedinger, 450 


Radial (Cont^d); 
field, 426, 448 
degeneracy in, 451 
momentum, 426, 448 
Radiative transition, 162 
Radioactivity, 227 
Rasetti, F., 227 
Ray, 285, 298> 313 
Re, 32 

Recursion formula, 26 
Reflection coefBcient, 214 
Relativistic quantum mechanics, 
605-529 

Relativity corrections, 472 
Representation : 
of differential operators, 334 
of mappings, 293 
of vectors, 302 
Resolution of statesi 111 
Rest energy, 472 
Rollers theorem, 26 n. 

Row symbol (s), 380 
addition and equality of, 381 
multiplication of, 381, 382 
Rutherford, E., 227 

& 

Scarborough, J. 181 n- 
Schroedinger, E., 1, 81* 230 
Schroedinger: 
constant, 80 
equation: 

first, 81, 168-179, 429 
second, 86 

expectation formula, 90, 93, 442, 
443 

function, 77, 86, 92, 428, 443 
operators, 77, 427, 467, 468 
representation, 72 
Secular : 

determinant, 323 
equation, 316 

Seitz, F., and Johnson, R, P., 
276 n. 

Selection rule, 166 
Separation of variables, 242 
Set: 

complete, 35 
maximal, 52 
normal, normalized, 37 
orthogonal, 33, 404 
orthonormal, 37, 404 
unitary, 37 



INDEX 


543 


Shortley, G. H., 468 n,: See- also 
Condon and Shortley 
Similarity transformation, 363 
Single-valued function, 401 . 
Slater, J. C., and Frank, N. H., 
187 n., 460 n. 

Sommerfeld, A., 20 n., 192, 407 n., 
471, 476 n. 

Space function, 503 
Specification of states, 89, 386, 442 
Spectrum; 
eigenvalue, 16, 319 
energy, 84 

line, of oscillator, 166 
term, 84 

Spherical harmonic, 414, 632 
Spin: 

angular momentum, 477 
existence of, 613 
precession of, 490 
corrections, 498 
function, 503 

magnetic moment, 477, 514 
matrices, 484, 612 
Spin-orbit interaction, 496 
Spur, 322 n. 

Stark effect, 206 
State (s) : 

composition of, 106 
definition of, 69 
energy, 103 
excited, 103 
mean life of, 167 
momentum, 131 
normal, 94, 462, 525 
psi, 92 

quantum, 104 
resolution of. 111 
fi, p, d, /, 451 

specification of, 89, 386, 442 
stationary, 101 
superposition of, 109, 117 
Stern-Gerlach experiment, 477 
String, 239 

Sturm-Liouville problem, 16 
Sturm’s theorem, 173 
Superposition: 
of states: 
coherent, 118, 140 
incoherent, 117 
of waves, 232 
Symbol: 
column, 302, 311 
row, 380 


Symbolic method, 391-400 
Symbols; . 

-, 32, 325, 380; 1 I, 33; ’, 41; , 43, 
251; ( >, 74; (1), 141, 142; 
299; 326; 344; (I 1), 346; 

0 , 436; ^ and approxi- 
mate equality 


T 

Taylor series, 200 n. 

Term spectrum, 84 
Theta, 412, 632 
‘ Thomas, L. H., 496 
Time: 

dependence; 
explicit, 46 

of Heisenberg matrices, 348, 354 
of psi, 86, 280, 387, 441, 488, 609 
derivative: 
of average, 247 ^ 
of dynamical variable, 250, 251 
of matrix, 349 
factor, 87, 348 
tr, 322 
Trace, 322 
Transform; 

Fourier, 143 
matrix, 363 

Transformation, similarity, 363 
unitary, 366 

Transition probability, 164, 849 

Traxismission; 
coefficient, 214 
approximate, 221 
selective, 221 
Transpose of matrix, 326 
Tunnel effect, 214^ 

Two-way integration, 181 


TJ 

XJhlenbeck, G. E., and Goudsmit, 
8„ 477 

Uncertainty, 64 n. 

Umqueness; 
of distribution, 68 
of expansion, 35 
Unitary: 

frame change, 364 
function, 37, 403 
matrix, 364, 365 
set, 37 

transformation, 366 
vector, 293 



544 


INDEX 


TJpsilon, 414, 632 

Uspensky, J. V., 59 n., 60 n., 100 n. 

V 

Van Vleck, J. H., 1 n., 74 n. 

Variation method, 193 

Vector (s): 

addition (sum) of, 286, 312 
as operands, 316 
complex, 297, 321 
complex conjugate of, 381 
component of, 293, 297, 311 
definition of, 286, 297, 311, 321 
Heisenberg, 386 
length of, 293, 299, 312, 330 
linearly dependent, 286 
multiplication of, 286, 303, 312, 
313, 382 

orthogonal, 293, 299, 312 
orthonormal, 293 
propagation, 426 
representation of, 302 
unitary, 293 

Velocity: 
group, 233, 237 
phase, 232 

Virtual : 
binding, 222 
level, 225 

W 

'Wave(8): 

de Broglie, 236, 448 


'W'ave(s) (Cont'd): 
complex, 236, 425 
composite, 232 
equation, Schroedinger, 429 
function, 77 
length, 231, 236, 426 
modulated, 233, 234 
monochromatic, 231, 236, 426 
number, 231, 425 
packet, 234 
superposition of, 232 

W. B. K. method, 186 n,, 188 

Well-behaved function, 11, 278, 402, 
486, 611 

Well, potential, 60, 160 

Wentzel, G., 186 n. 

Wilson, E. B.: See Pauling and 
Wilson 

Wilson-Sommerfeld rule, 192 
X 

xyz, 401, 420 

Y 

Young, L. A. : Frank and Ifoung 

Z 

2^eeman effect, 205 

Zero; 

matrix, 322 
operator, 3 



